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Abstract 

Proximity maps and regions are defined based on the relative allocation of points from two or more classes 
in an area of interest and are used to construct random graphs called proximity catch digraphs (PCDs) 
which have applications in various fields. The simplest of such maps is the spherical proximity map 
which maps a point from the class of interest to a disk centered at the same point with radius being the 
distance to the closest point from the other class in the region. The spherical proximity map gave rise to 
class cover catch digraph (CCCD) which was applied to pattern classification. Furthermore for uniform 
data on the real line, the exact and asymptotic distribution of the domination number of CCCDs were 
analytically available. In this article, we determine some appealing properties of the spherical proximity 
map in compact intervals on the real line and use these properties as a guideline for defining new proximity 
maps in higher dimensions. Delaunay triangulation is used to partition the region of interest in higher 
dimensions. Furthermore, we introduce the auxiliary tools used for the construction of the new proximity 
maps, as well as some related concepts that will be used in the investigation and comparison of them 
and the resulting graphs. We characterize the geometry invariance of PCDs for uniform data. We also 
provide some newly defined proximity maps in higher dimensions as illustrative examples. 

Keywords: class cover catch digraph (CCCD); Delaunay triangulation; domination number; proximity map; 
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1 Introduction 



Classification and clustering have received considerable attention in the statistical literature. In recent 
years, a new classification approach has been developed. This approach is based on the proximity maps that 
incorporate the relative positions of the data points from various classes. Proximity maps and the associated 
(di)graphs are used in disciplines where shape and structure are crucial. Examples include computer vision 
(dot patterns), image analysis, pattern recognition (prototype sel ection), geograph y and cartography, visual 
perception, biology, etc. Proximity graphs were first introduced bv lToussain^(jl980[) . who called them relative 
neighborhood graphs. The notion of relative neighborhood graph has been generalized in several directions 
and all of these graphs are now called proximity graphs. From a mathematical and algorithmic point of 
view, proximity graphs fall under the category of computational geometry. 

A general definition of proximity graphs is as follows: Let V be any finite or infinite set of points in W^. 
Each (unordered) pair of points {p,q) € V x V is associated with a neighborhood D^(p, g) C W^. Let *p be 
a property defined on 01 = {OT(p, g) : (p, g) G x V}. A proximity (or neighborhood) graph G(n_(p(y,i?) 
defined by the property Cp is a graph with the vertex set V and the edge set E such that (p, g) € i? iff Ol(p, q) 
satisfies property *p. Examples of most commonly used proximity graphs are the Delaunay tessellation, the 
boundary of the convex hull, the Gabriel graph, relative neighbor hood graph, Euclidean minimum spanning 
tree, and sphere of influence graph of a finite data set. See, e.g.. iJaromczvk and Toussaind |l992f) for more 
detail. The Delauna y tessellation 'D(V) of a finite set of points V , is the dual of the Voronoi diagram 
generated by V. See lOkabe et al.l (|200d) for further details. The convex hull of a set V C M'', denoted 
as Cff (y), is the intersection of all convex sets (there exists infinitely many of them) that contain V . The 
boundary oiCuiy) can be viewed as a proximity graph which is also a subgraph of 'DiV). The Gabriel graph 
of V, denoted as GGiV), is defined as the graph in which (p, q) is an edge of GG{V) iff the circle centered 
at the midpoint of the line segment pq and with diameter d{p, q) (the distance between p and q) does not 
contain any other points from V . The relative neighborhood graph of is a prominent representative of the 
family of graphs which are defined by some sort of neighborliness. For a set of points V C K'^, the relative 
neighborhood graph of V , denoted RNG{V), is a graph with vertex set V and edge set which are exactly the 
pairs {p, q) of points for which d{p, q) < min^gy max((i(p, v), d{q, v)). That is, (p, q) is an edge of RNG[V) 
iff Lune(p, q) does not contain any other points of V , w here Lune(iP, q) is defined a s the i ntersection of two 
discs centered at p, g each with radius d{p,q) (see, e.g., Ijaromczvk and ToussaintI (|l992f )V The Euclidean 
minimum spanning tree of V, denoted EMST(y), is defined as the spanning tree in which the sum of the 
Euclidean lengths of the edges yield the minimum over all spanning trees with vertex set V . The sphere of 
influence graph on V , denoted as SIG{V), has vertex set V and (p, q) as an edge iff the circles centered at p 
and q with radii min„gy\{p} d{p,v) and min,„gy\{^} d{q,v), respectively, have nonempty intersection. Note 
that EMST( V) is a subgraph of RNG{V) which is a subgraph of GG{V) which is a subgraph of V{V) (see 
Okabe et all tOOd) ). Furthermore, in the examples above, d{x, y), can be any distance in M.'^. Furthermore, 



the distance between a point x and a set A is defined as d{x, A) := infj^g^ d{x, y); and the distance between 
two sets A and B is defined as d{A, B) := inf(2. ^^g^xs '^(-'c, y). 

A digraph is a directed graph, i.e., a graph with directed edges from one vertex to another based on a 
binary relation. Then the pair {p,q) G V x V is an ordered pair and is an arc (directed edge) denoted as 
pq to reflect the differen ce between an arc and an edge. For example, the nearest neighbor (di)graph in 
Paterson and Yaol (|1992[ ) is a proximity digraph. The nearest neighbor digraph, denoted as NND(V), has 



the vertex set V and pq as an arc iff d{p, q) = min„gy\{p} d{p, v). That is, pq is an arc of NND(V) iff g is a 
nearest neighbor of p. Note that if pq is an arc in NND{V), then (p, q) is an edge in RNG{V). 

The proximity catch digraphs (PCDs) are based on the property *P that is determined by the following 
mapping which is defined in a more general space than K''. Let (fi, A4) be a measurable space. The proximity 
map N{-) is given hy N : fl ^ p{^)i where p(-) is the power set functional, and the proximity region of 
X G ^l, denoted as N{x), is the image oi x G under N{-). The points in N{x) are thought of as being 
"closer" t o X G ^ than are the points in 57 \ N{x). Proximity r naps are the building blocks of th e proximity 
graphs of Toussain^ ( 1980( ): an extensive survey is available bv I Jaromczvk and Toussaint ( 1992 ). The PCD 
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D has the vertex set V = {pi, . . . ,pn} and the arc set A is defined as piPj S ^ iff pj G N{pi) for i ^ j. 
Notice that the PCD D depends on the proximity map N{-), and if pj S N{pi), then N{pi) is said to catch 
Pj . Hence the name proximity catch digraph. If arcs of the form Pi Vj (i.e., loops) were allowed, D would have 
been called a pseudodigraph according to some authors (see, e.g., Chartrand and LesniakI ( 19961 )). 

The finite and asymptotic distribution of the domination number of CCCDs for uniform data in R are 
mathematically tractable. In this article, we determine some appealing properties of the proximity map 
associated with CCCD for data in a compact interval in M and use them as guidelines for defining new 
proximity maps in higher dimensions. As CCCD behaves nicely for uniform data in M (in the sense that 
the exact and asymptotic distributions of the domination number and relative arc density are available), by 
emulating its properties in higher dimensions we expect the new PCDs will behave similarly. Furthermore, 
we introduce some auxiliary tools used for the construction of the new proximity maps, as well as some 
related concepts that will be used in the investigation and comparison of the proximity maps. Additionally, 
we discuss the conditions for the geometry invariance for uniform data in triangles. 

We describe the data-random PCDs in Section [21 Voronoi diagrams and Delaunay tessellations in Sec- 
tion [3l the appealing properties of spherical proximity maps in M in Section IH transformations preserving 
uniformity on triangles in in Section [5l triangle centers in Section [51 vertex and edge regions in Section [3 
and proximity regions in Delaunay tessellations in Section [SI We present the results on relative arc density 
and the domination number of the new PCDs in Section [9l introduce two new proximity maps in Section 
[TUl and discussion and conclusions in Section [TTI 



2 Data-Random Proximity Catch Digraphs 



Priebe et al.l (|200lh introduced the class cover catch digraphs (CCCDs) and gave the exact and the asymp- 



totic distribution of the domination number of the CCCD based on two classes X„ and y„, both of 



which are random samples f r om uniform d i stribut i on on a compact inte rval i n R. DeVinnev et al. ( 20021) 



Marchette and Priebd |2003f) . [Priebe et all (|2003al ) . [Priebe et al.l (|2003bf) . and [DeVinnev and PrieQ (|2006l ) 
applied the concept in higher dimensions and demonstrated relatively good performance of CCCDs in clas- 
sification. The methods employed involve data reduction (condensing) by using approximate minimum 
dominating sets as prototype sets since findi ng the exact mini mum dominating set is in general an NP-hard 
problem — in particular, for CCCDs — (see DeVinnev ( 20031 )). Furthermore, the exact and the asymptotic 
distribution of the domination number of the CCCDs are not analytically tractable in dimensions greater 
than 1. 

Let Xn = {A'i, . . . , Xji} and 3^™ = { Yi, . . . , Ym} be two sets of ri-valucd random variables from classes 
X and y whose joint pdf is fx,Y- Let d{-,-) : ^lxil [0, oo) be a distance function. The class cover problem 
for a target class, say X^, refers to finding a collection of neighborhoods, N{Xi) around Xi G A'„ such that (i) 
'^n C (|J-7V(X,;)) and (ii) 3^m n(Ui -^(^0) = 0- A collection of neighborhoods satisfying both conditions 
is called a class cover. A cover satisfying condition (i) is a proper cover of class X while a collection 
satisfying condition (ii) is a pure cover relative to class y. From a practical point of view, for example 
for classification, of particular interest are the class covers satisfying both (i) and (ii) with the smallest 
collection of neighborhood s, i.e., minimum cardinality cove r. This class cover problem is a generalization 
of the set cover problem in Garfinkel and Nemhauserl ( 19721 ) that emerged in statistical pattern recognition 
and machine l earning, where an edited or condensed set (i.e., prototype set) is selected from Xn (see, e.g., 
Devrove et all ([l996)). 

In particular, the proximity regions are constructed using data sets from two classes. Given 3^,„ C f2, 
the proximity map Ny(-) : fl —f p(fl) associates a proximity region Ny{x) C Q with each point x € fl. The 
region Ny{x) is defined in terms of the distance between x and ym- More specifically, our proximity maps 
will be based on the rela tive position o f poin ts from class X with respect to the Delaunay tessellation of the 
points from class y. See Okabe et al. ( 2000l ) for more on Delaunay tessellations. 
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If Xn = {-'^1, ■ • ■ ,Xn} is a set of r2- valued random variables then Ny {Xi) are random sets. If Xi are 
independent identically distributed then so are the random sets Ny (Xi). The data-random PCD D — 
associated with Ny{-) — is defined with vertex set = {Xi, ■ ■ ■ ,Xn} and arc set A by XiXj E A <=^ 
Xj £ Ny (Xi). Since this relationship is not symmetric, a digraph is used rather than a graph. The random 
digraph D depends on the (joint) distribution of the Xi and on the map Ny{-). Let fi{Ny) := P{XiXj S 
A) = P{Xj G Ny [Xi)); so ^J.{Ny) is the probability of having an arc from Xi to Xj, hence is called arc 
probability for the PCD based on Ny. 

The PCDs are closely related to the proxim ity graphs of iJaromczvk and ToussaintI (Il992l) and might be 
considered as a special case of covering sets of Tuza (1994) and intersection digraphs of Sen et al. (|l989[ ). 
This data randoni proxim ity digraph is a vertex-random proximity digraph which is not of standard type (see, 
e.g., Janson et al. The randomness of the PCDs lies in the fact that the vertices are random with 

joint pdf fx,Y, but arcs XiXj are determin istic functions of the random variable Xj and the set Ny (Xi). For 
example, the CCCD of IPriebe et al. ( 2001 ) can be viewed as an example of PCD with Ny{x) = B{x, r(x)), 
where r{x) := miuygj;^ d{x, y). The CCCD is the digraph of order n with vertex set Xn and an arc from Xi 
to Xj iff Xj G B{Xi, r{Xi)). That is, there is an arc from Xi to Xj iff there exists an open ball centered at 
Xi which is "pure" (or contains no elements) of , and simultaneously contains (or "catches" ) point Xj . 

Notice that the CCCDs are defined with (open) balls only, whereas PCDs are not based on a particular 
geometric shape or a functional form; that is, PCDs admit Ny{-) to be any type of region, e.g., circle (ball), 
arc slice, triangle, a convex or nonconvex polygon, etc. In this sense, the PCDs are defined in a more general 
setting compared to CCCDs. On the other hand, the types of PCDs introduced in this article are well- 
defined for points restricted to the convex hull of y„i, Ch (3^m)- Moreover, the proximity maps introduced 
in this article will yield closed regions. Furthermore, the CCCDs based on balls use proximity regions that 
are defined by the obvious metric, while the PCDs do not necessarily require a metric, but some sort of 
dissimilarity measure only. 



3 Voronoi Diagrams and Delaunay Tessellations 



The proximity map in E defined as B{X,r(X)) where r(X) = miiiYey„ 

, d{X,Y) in ()Priebe et all (|200ll) ). 



Our next goal it to extend this idea to higher dimensions and investigate the properties of the associated 
digraph. Now let = {yi, . . . ,ym} C M and Yj;m be the j*'* order statistic. The above definition of the 
proximity map is based on the intervals Ij = (y(j-i):m, Yj;„i) for j — 0, . . . , (to + 1) with yo:m = — oo and 
y(m+i):m = oo- This intcrvalization can be viewed as a tessellation since it partitions C//(y„i), the convex 
hull of For d > 1, a natural tessellation that partitions Cniym) is the Delaunay tessellation, where 
each Delaunay cell is a (d + l)-simplex (e.g., a Delaunay cell is an interval for d = 1, a triangle for d = 2, 
a tetrahedron for d = 3, and so on.) Let Tj be the j*'' Delaunay cell for j = 1, . . . , J in the Delaunay 
tessellation of 3^,„. In M, the cell that contains x is implicitly used to define the proximity map. 

A tessellation is a partition of a space i nto convex polytop es; tessellation of the plane (into convex 



polygons) is the most frequently studied case (jSchoenberd (|2002[ )'). Given 2 < ti < oo distinct points in 



we associate all points in the space with the closest member (s) of the point set with respect to the Euclidean 
distance. The result is a tessellation of the plane into a set of regions associated with the n points. We call 
this tessellation the planar ordinary Voronoi diagram generated by the point set and the regions ordinary 
Voronoi polygons. See Figure[l]for an example with m = \ym\ = 10 3^ points iid from U[{0, 1) x (0, 1)). 

In general, let P = {pi,p2, . . . ,Pn} be n points in M.'^ where 2 < n < oo and pi ^ pj for i ^ j, 
i,j G [n] := {l, 2, . . . ,n} and let || • || denote the norm functional. We call the region Vc{Pi) ~ {.t £ M'' : 
1 1 — Pill < ||x — pjll for j i, i € [n]} the (ordinary) Voronoi polygon or cell associated with pi and the 
set 9J = {Vc(pi), ■ • . , Vc(Pn)} the Voronoi diagram or Dirichlet tessellation generated by P. We call pi 
the generator of Vcipi)- The Voronoi diagram partitions the space into disjoint regions (which are also 
called tiles or Thiessen polygons in R^). Notice that we still say 03 partitions the space M**, although Vc{Pj) 
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are not necessarily disjoint, but if nonempty the intersection lies in a lower dimension, or equivalently, has 
zero M^'-Lebesgue measure. We stick to this convention throughout the article. The intersection of two 
Voronoi cells, if nonempty, i.e., for i ^ j, Vc{Pi) H Vc{Pj) 7^ 0, is called a Voronoi edge. If a Voronoi edge 
is not a point, then VciPi) and Vcipj) are said to be adjacent. An end point of a Voronoi edge is called a 
Voronoi vertex. QJ is called degenerate if at a Voronoi vertex, more than three Voronoi polygons intersect, 
a nd non-degenerate o therwise. A detailed discussion including the history of Voronoi diagrams is available 



Okabe et all (|2000l ). 



Voronoi Diagram Dclaunay Triangulation 




0.0 0.2 0.4 0.6 O.B 1.0 0.0 0.2 0.4 0.6 O.B 1.0 



Figure 1: Depicted are 10 y points generated iid U{Q, 1) x (0, 1) (top), the corresponding Voronoi diagram 
(bottom left) and the Dclaunay triangulation (bottom right). 

Given a Voronoi diagram with n > d+l non-coplanar (i.e., not all the points lie on a (d— l)-dimcnsional 
hyperplane) generators, P, in R'', we join all pairs of generators whose Voronoi cells have a common Voronoi 
edge. The resulting tessellation is called the Delaunay tessellation of P. See Figure [1] for the Delaunay 
triangulation associated with the Voronoi diagram based on 10 points iid fromZ^((0, 1) x (0, 1)). By definition 
a Delaunay tessellation of a finite set of points, P, is the dual of the Voronoi diagram based on the same 
set. The tessellation yields a (unique) polytopization provided that no more than {d+l) points in M.'^ are 
cospherical (i.e., no more than {d + 1) points lie on the boundary of a (hyper)sphere in R''). Moreover, the 
circumsphere of each Delaunay polytope (i.e., the sphere that contains the vertices of the Delaunay polytope 
on its boundary) is pure from the set P; i.e., the interior of the circumsphere of the Delaunay polytope 
does not contain any points from P. The Delaunay tessellation partitions C g(P). In part i cular, in M^, the 
tessellation is a triangulation that yields triangles Tj, j = 1, . . . , J (see, e.g., Okabe et al.l (2000)) provided 



that no more than three points arc cocircular (i.e., no more than three points lie on the boundary of some 
circle in M^). In this article we adopt the convention that a triangle refers to the closed region bounded 



by its edges. See Figure [2] for two examples: an example with n = 200 X points ^'-^ ^((0, 1) x (0, 1)), the 
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Figure 2: A realization of 200 X points and the Delaunay triangulation based on 10 3^ points in Figure [T] 
(top). Another reahzation of 200 X and 20 y points and the Delaunay triangulation based on (bottom). 

uniform distribution on the unit square and the Delaunay triangulation based on the 10 3^ points in Figure 
[T] and an example with n = 200 X points ^((0, 1) x (0, 1)), the uniform distribution on the imit square 
and the Delaunay triangulation is based on 20 y points also U[{0, 1) x (0, 1)). 

3.1 Poisson Delaunay Triangles 

The Delaunay triangles are based on a given set of points 3^™. The set can be assumed to come from a 
Poisson point process on a finite region, and in the application of PCDs, to remove the conditioning on y^-, 
it is suggested that y^ comes from a Poisson point process for prospective research directions. We briefly 
describe the Poisson point processes and Poisson Delaunay triangles. 

A stochastic point process on R"* is defined to be a process in which points are generated according to a 
probability distribution P{\y n B| = fc) , fc = 0, 1, 2, . . ., over any B C M''. For example, a binomial point 
process is a stochastic point process in which n points are generated over a bounded set S' C M'^ according to 
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the uniform distribution. In particular, if d = 2 the process is called a planar stochastic point process. If two 
points coincide with probability zero, then it is a simple stochastic point process. A stochastic point process 
is said to be locally fi nite if any fin i te reg ion B C M'' contains a finite number of points with probability 1 



IS said to be locally p mte it any hn i te reg ie 
under the process (see lOkabe et al. ( 20001 )) 



We have built the Delaunay tessellation using with finite sample size. Suppose 3^ is from a stochastic 
point process. One of the most fundamental locally finite stochastic point processes is the Poisson point 
process, which is defined as the process that satisfies 

P(|3^nSHfc) = ^^l^^i^^ , A: = 0,l,2,... 

for any B C , where V(-) denotes the d-dimensional volume functional and A > is the intensity (number 
of points per unit volume) of the process. We can also define the Poisson point process as the limit of 
the binomial point process in the sense of expanding the finite region S to an infinite region while keeping 
A = n/\{S) constant. We call the Delaunay tessellation based on a finite data set from a Poisson point 
process Poisson Delaunay tessellation and denote it 'Dp. The associated Voronoi diagram is called the 



Poisso n Voronoi diagram and is denoted by Vp. For more detail on the properties of Vp, see, lOkabe et al 
( 2000h . 



A simplex in is the convex hull of any {d+ 1) points in general position, i.e., no 1) of the points lie 
in a (d— l)-dimensional hyperplane in R'^. The simplex is the point itself for d = 0, the line segment joining 
the two points for d = 1, a triangle for d = 2, a tetrahedron for d = 3, and so on. Each Poisson Delaunay 
cell of Dp is a (d + l)-dimensional simplex whose vertices are xq, xi, . . . , Xm from the Poisson point process. 
Any s-face of Dp is an (s + I)-dimensional simplex with vertices xq, . . . ,Xs, also points from the Poisson 

point process. There are f ^ ) many s-faces contained in a Poisson Delaunay cell for < s < d. 



s + 1, 

Let c and r be the circumcenter and circumradius, respectively, of a (d+ l)-dimensional Poisson Delaunay 
cell in M'*. Then the (d + 1) vertices of the cell are the points {c + r ui} where {ui} are the unit vectors for 
z = 0, 1, . . . , d. The ergodic joint probability density function (pdf) of Dp is completely specified as 

fir,uo, ...,Ud) = a{\,d) Aar'^'"'^ exp [-Xwdr'^) 

where is the volume of the (d + l)-simplex with vertices uq, ■ . ■ , Ud, Wd = 7r'*''^/r(d/2 + 1) and 

^(d^+i)/2 p(,^2 [2 A r((d + l)/2)] 



a(A, d) 



d'i-2r(d/2)2'i+lr((d2 + l)/2) 



The circumradius r may be viewed as a measure of size of the (d + l)-simplex, and is independent of 
{uo, . . . , Md}. The pdf of r is a generalized gamma function with t = d, q = d^ and h ^ \d, where a 
3-parameter generalized gamma function is 

/(x) = r6«/*x«-i exp(-5x*)/r(g(t)) for x,t,p,q> 0. 

Let Vd denote the volume of a typical Poisson Delaunay cell. Then for d = 2, the expected value of the fc*'* 
moment of volume of a typical Poisson Delaunay triangle is 

E[y,1= r((3fc + 5)/2)r(fc/2 + i) 

3r((fc + 3)/2) 2fc7rfc-i/2Afc 
In R2, the joint pdf of a pair of inner angles arbitrarily selected from an arbitrary triangle in Pp is given 

by 

g 

f{x, y) — — (sin x) (sin y) sin(a; + y), for a;, y > and x + y < ir. 
on 
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Notice that the mode of this density is at a; = y = 7r/3, which impUcs that the most frequent triangles in a 
2?P are nearly equilateral triangles. By integrating over y, we obtain the pdf of a randomly selected inner 
angle of an arbitrary triangle from Dp: 



— ( (tt — x) cos X + sin x) sin x 



1(0 < x < tt) 



where I(-) is the indicator function. Then the expected value of X is E [X] — 7r/3 and E [X^] = 2 7r^/9 — 5/6. 



Mardia et al.l (|l977t ) derived the pdf of the minimum angle as 



/i (x) = — (tt — 3x) sin 2x + cos 2x — cos 4a; ) 1 1 < x < — 



and iBootd (|l986[ ) gave the pdf of maximum angle, 
^2 



- (3a;(sin 2x) — cos 2x + cos 4a; — tt sin 2a;) 



V3 



< a; < 



(47r(cosx)(sina;) + 3sina;^ — cosa;^ — 4a;(cosa;)(sina;) + l) 



I I - < a; < TT 



The distribution of the length of an arbitrary edge e of an arbitrary triangle from Dp is 



fhix) = ( \/Aa;exp ( — ttA— ) + crfc (VttA— 



x2 



where erfc is the complimentary error function erfc(z) = --j= j e dt = tt 2r[-,z] (jOkabe et al 



4 The Appealing Properties of Spherical Proximity Maps in M 

The CCCDs have desirable properties such as having the finite sample and asymptotic distributions of the 
domination number available. In this section, we determine some appealing properties of the proximity map 
associated with CCCD for uniform data in a compact interval in R and use them as guidelines for defining 
new proximity maps in higher dimensions. We believe these properties cause the CCCD to behave so "nicely" 
in R and the more they are satisfied by the new PCDs in higher dimensions, the more likely the new PCDs 
to have similar behaviour. Furthermore, we introduce the auxiliary tools used for the construction of the 
new proximity maps, as well as some related concepts that will be used in the investigation and comparison 
of the proximity maps. 

Let ym = {yi, ■ • ■ lYm} C M. Then the proximity map associated with CCCD is defined as the open 
ball N six) := B{x,r{x)) for all x G K\3^m, where r{x) = miuygj;^^ d(a;, y) (see Section and IPriebe et al 
(|200ll) ) with d{x,y) be ing the Euclidean distance between x and y. For x E define Ns{x) = {x}. Notice 
that a ball is a sphere in higher dimensions, hence the name spherical proximity map and the notation Ns- 
Furthermore, dependence on ym is through r{x). Note that, this proximity map is based on the intervals 
li = (y(i-i):m,yi:m) for i = 0, . . . , (m + 1) with yo:m = -oo and y(m+i):m = oo where yi:m is the i*'^ order 
statistic of y^n ■ 

For Xi without loss of generality we can assume = (0, 1). Then the arc probability fJ-iNg) = 

1-1/2 1-2x1 pi 1-1 

P{X2 G Ns{Xi)) = 1/2, since P{X2 G Ns{Xi)) = / / dx2dxi + / dxadxi ^ 1/2. 

Jo Jo Jl/2 J2-2xi 
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A natural extension of the proximity region Ns{x) to multiple dimensions (i.e., to R'' with d > 1) is 
obtained by the same definition as above; that is, Ns{x) := B(x,r{x)) where r{x) := Toiny^y^ d{x,y). The 
spherical proximity map Ns{x) is well-defined for all a; G K'^ provided that 3^™ ^ 0. Extensions to and 
hi gher dimensions with th e spherical proximity map — w i th application s in clas si fication — ar e invest igated 
DeVinnev ela P (l200 ^Marchette and Priebd (|2003l ). IPriebe et al.1 (|2003al) . IPriebe et all (|2003b[ ). and 



DeVinnev and Priebc (2006f ). However, finding t he minimum dominating set of the PCD associated with 



Ns{-) is an NP-hard problem dPeVinnevI (l2003l) ) and the distribution of the domination number is not 



analytically tractable for d > 1 ()Cevhanl (|2004f )). This drawback has motivated the definition of new types 



of proximity maps in higher dimensions. Note that for d = 1, such problems do not exist. We state some 
appealing properties of the proximity map Ns{x) = B{x,r{x)) in M and use them as guidelines for our 
definition of new proximity maps: 

PI Ns{x) is well-defined for all x e Ch (3^,™) = [yi:rn,ym:m]. 

P2 a; e Nsix) for aU xeCn (3^m). 

P3 The point x is at the center of Ns{x) for all x E Ch (3^™). 

P4 For X E li Q Ch (3^m), Ns{x) and It are of the same type; i.e., they are both intervals. 

P5 For X E li Q Ch {y■m)^ Ns{x) mimics the shape of /,;; i.e., it is (geometrically) similar to 7^. 

P6 For X e li, Ns{x) is a proper subset of It for all x E Ii \ {(y(i_i):m -I- yi:m)/2} (or almost everywhere 
in li). 

P7 For X £ li and y E Ij with i j, Ns{x) and Ns{y) are disjoint regions. 

P8 The size (i.e., measure) of Ns{x) is continuous in x; that is, for each e > there exists a (5(e) > such 
that ||A^5(?/)| - |iVs(a;)|| < e whenever \d{x,y)\ < 5{e). 

P9 The arc probability fJ-{Ns) does not depend on the support interval for imiform data in M. 

Notice that properties PI, P2, and P3 also hold for all x E K. P9 implies that not only the arc 
probability but also the distribution of the relative arc density and domination number do not depend on 
the support interval either. This independence of the support set is called geometry invariance in higher 
dimensions (see Section[5]). For Ns it suffices to work with U{0, 1) data, and in higher dimensions we will be 
able to consider only uniform data in an equilateral triangle for PCDs based on proximity maps that satisfy 
P9. 

Suppose we partition the convex hull of y,n, Ch (3^m) by Delaunay tessellation. Let Tj be the j*'* Delaunay 
cell in the Delaunay tessellation of 3^™ for j = 1, . . . ,J, where J is the total number of Delaunay cells. See 
Figure [2] for two y sets of sizes 10 and 20 and the corresponding Delaunay triangulations. 

Note that P4 and P5 are equivalent when d = 1 for .t e Ch (ym), since any two (compact) intervals in R 
are (geometrically) similar. For d > f , P5 implies P4 only, since, for example, for d = 2, any two triangles 
are not necessarily similar, but similar triangles are always of the same type; they are triangles. 

Notice that Ns{-) satisfies only PI, P2, P3, and P8 in with d > 1. P4 and P5 fail since Ns{x) is 
a sphere for x ^ ym, but Tj is a (d -I- l)-simplex. For any x E Tj C R'', B{x,r{x)) (fi Tj, so P6 also fails, 
furthermore this also implies that Ns{x) and Ns{y) might overlap for x,y from two distinct cells, hence P7 
is violated. The arc probability n{Ns) depends on the support set % for d > 1 so P9 is violated. 

The appealing properties mentioned above can be extended to more general measurable spaces. Let 
be a measurable space, and let ^Ij, j E {1, 2, ... , J} partition fi, and /i be the associated measure 
on f2. Then the appealing properties are 

PI N{x) is well-defined for all x E i^. 
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P2 X e N{x) for all x e n. 

P3 X is at the center of N{x) for all .t e O. 

P4 For x G N{x) and arc of the same type; they have the same functional form. 

P5 For X G r2j C il, A^(a;) mimics the shape of flj] i.e., it is similar to Qj. 

P6 For X G ilj, N{X) is a proper subset of ilj a.s. 

P7 For X € f2j and y ilk with 7^ fc, A^(x) and N{y) are disjoint. 

P8 The measure of N{x) is continuous in x; that is, for each e > there exists a 6{e) > such that 
IK^iy)) - < '5(e) whenever ||y - a;|| < e. 

P9 The arc probability ni^s) does not depend on the support set for uniform data in fl. 

Property P6 suggests a new concept. 

Definition 4.1. The superset region for any proximity map iV(-) in ft is defined to be ^%s{N) := {x S : 
N{x) = Vl). □ 

For example, for = C R, ^s{Ns) ■= {x e h : Ns{x) = /J = {(y(,_i):,„ + y^■.„^) /2}, and for ^ = 
% C W^, ^s{Ns) := {x€%: Nsix) = Ti}. Note that for x G /„ XiNsix)) < X{I,) and X{Ns{x)) = A(/j) iff 
X G ^s{Ns) where A(-) is the Lebesgue measure on R (also called R-Lebesgue measure). So the proximity 
region of a point in ^s(A^s) has the largest R-Lebesgue measure. Note that for ~ {yi, . . . ,ym} C M, 

^siNs) ^ { XiZ^, . . . , yi-^y--+y--y ,,,, ,hat given y^, ^.(iV,) is not a random set, but 

1{X G ^s{Ns)) is a random variable. Furthermore, P6 is equivalent to ^s{Ns) having zero R-Lebesgue 
measure. On the other hand, for x G d{Ii) = {y(i-i):m, Yili the proximity region Ns{x) = {x} has zero 
R-Lebesgue measure. This suggests the following concept. 

Definition 4.2. Let be a measurable space. The Ao-region for any proximity map N{-) is defined to 

be Ao(A^) -.^ {xen-. n{N{x)) = O}. □ 

For n = R'', Ao(A^s) := {x G R^' : X{Nsix)) = O}. For example, for n = Ch (Xn) £ R, Ao(A^s) = 3^m, 
since A(iVs(x)) = iff x G 3^™. 

Furthermore, given a set B of size ri in [yi:m,ym:m] \ 3^m, P7 implies that the number of disconnected 
components in the PCD based on Ns{-) is at least the cardinality of {i G [m] : _B n 7^ 0}, which is the set 
of indices of the intervals that contain some point(s) from B and [m] :— {0, 1, . . . , m — l}. 

5 Transformations Preserving Uniformity on Triangles in 

The property P9 suggests that in higher dimensions the arc probability of PCDs based on uniform data is 
geometry invariant, i.e., does not depend on the geometry of the support set. The set Xn is assumed to be 
a set of iid uniform random variables on the convex hull of 3^™; i-C, a random sample from U {Ch (3^m))- In 
particular, conditional on |A^„nTj| > being fixed, X„nTj will also be a set of iid uniform random variables 
on Tj for j G {1, 2, . . . , J}. The geometry invariance property will reduce the triangle Tj as much as possible 
while preserving uniformity and the probabilities related to PCDs will simplify in notation and calculations. 
Below, we present such a transformation that reduces a single triangle to the standard equilateral triangle 
Te = T((0,0),(l,0),(l/2,V3/2)). 
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Let 3^3 = {yi, y2, Ya} C be three non-coUinear points and T (3^3) be the triangle with vertices yi, y2, Ya- 



Let Xi - U{T (3^3)), the miiform distribution on T (3^3), for j = 1, . . . , n. The pdf oiU{T (3^3)) is 



/(") 



1 



^(^(3^3)) 



I(«er(y3)), 



where A{-) is the area functional. 

The triangle r(3'3) can be carried into the first quadrant by a composition of transformations in such 
a way that the largest edge has unit length and lies on the x-axis, and the x-coordinatc of the vertex 
nonadjacent to largest edge is less than 1/2. We call the resultant triangle the basic triangle and denote 
it as n where Tb = ((0,0), (1,0), (ci,C2)) with < ci < 1/2, and C2 > and (1 - cif + cj < I. We will 
describe such transformations below: Let be the edge opposite to the vertex y^ for i S {1, 2, 3}. Find the 
lengths of the edges; say 63 is of maximum length. Then scale the triangle so that 63 is of unit length. Next 
translate yi to (0,0), and rotate (if necessary) the triangle so that y2 = (1,0). If the y-coordinate of y3 is 
negative reflect the triangle around the x-axis, then if a;-coordinate of y3 is greater than 1/2, reflect it around 
X ~ 1/2, then the associated basic triangle Ti, is obtained. So the basic triangle can be obtained by a 
transformation denoted by which is a composition of rigid motion transformations (namely translation, 
rotation, and reflection) and scaling. Hence if 2^(3^3) is transformed into Th, then r(3^3) is similar to Tf, 

and 06 {T (y^)) = Tf,. Thus the random variables Xi U{T (y^)) transformed along with T (y^) in the 

described fashion by (pj, satisfy (/)b{Xi) U(Tb). So, without loss of generality, we can assume T (3^3) to be 
the basic triangle. The functional form of Tt, is 

Tb = {{x, y) e : 2/ > 0; 2/ < (c2 x)/ci; y < C2 (1 - x)/{l - ci)} . 



If Cl = 1/2 and C2 = \/3/2, then Tb is an equilateral triangle; if C2 < \/ci — c\, then Tb is an obtuse triangle; 
if C2 = \/ci — cf , then Tb is a right triangle; and if C2 > \Jci~ c\, then Tb is an acute triangle. If C2 = 0, 
then the Tb reduces to the unit interval (0, 1). See Figure [3] for the domain of (ci, C2) for Tf, and the part on 
which Tb is a non-acute triangle. 




Figure 3: The shaded regions are the domain of (ci,C2) values for the basic triangle Tb (left) the domain 
where Tb is a non-acute triangle (right). 



Lemma 5.1. The arc probability fj.{Ny) of the PCD based on Ny for uniform data on T (y^) is rigid-motion 
and scale invariant; i.e., fj,{Ny) does not change under rigid motion transformations and does not depend 
on the scale of the support triangle T (3^3). 

Proof: We have shown that for X, - U{T (3^3)), we have (j)b{Xi) ~ U{Tb) since T (3^3) is similar to Tfc. For 
uniform data, the set probabilities are calculated as the ratio of the area of the set to the total area. So 
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p{x esc Tiy^)) ^ A{S)/A{T{y3)) and P{M^) e MS) c MTiy^))) = p{MX) e MS) c n) = 

A{MS))/A{Tb) = [kA{S)]/[kA{T {y3,))] = A{S) / A{T {y^)) where k is the scahng factor. Letting X = Xj 
and S = Ny{Xi), the desired resuh follows. ■ 

Based on Lemma |5. 11 for uniform data we can, without loss of generality, assume T (3^3) to be the basic 
triangle Tb. 



5.1 Transformation of Tt, to Tp 



There are also transformations that preserve uniformity of the random variable, but not similarity of the 
triangles. We only describe the transformation that maps T (y^) to the standard equilateral triangle, = 
T ((0,0), (1,0), (1/2, a/3/2)) for exploiting the symmetry in calculations using Tg. 

Let (pe : {x,y) — > (w, w), where u{x,y) = a; H 7=~~y ^^-^ v{x,y) = -^—y- Then yi is mapped to 

V3 2 C2 

(0,0), y2 is mapped to (1,0), and ys is mapped to (l/2, a/3/2). See also FigureH) Note that the inverse 
transformation is M^(u, 
the Jacobian is given by 

J{x,y) 



transformation is (p^ ^(u,v) = (x{u,v),y{u,v)^ where x{u,v) ~ u — - — — !^ w and y{u,v) = —^u. Then 



dx 
du 


dx 
dv 




1 


2 ci-1 
V3 


dy_ 
du 


dy 
dv 







2C2 

V3 



2C2 

V3' 



So fu,v{u,v) = fx,Y{4'e ^iu,v)) \J\ ~ -^l((u,w) e Te). Hcncc uniformity is preserved. 




Figure 4: The description of (j)e{x,y) for {x,y) € (left) and the equilateral triangle (peiTt) ~ (right). 



Theorem 5.2. The arc probability ^{Ny) of the PCD based on Ny for uniform data on Tb is geometry 
invariant iff A{(j)eiNy{x))) — A(N^^Q;^^((j)e{x))) for all x G Tb- 

Proof: By Lemma [5Tl the PCD based on Ny for uniform data on T (3^3) is rigid-motion and scale invariant. 
So T (y^) can be transformed to Tb preserving the uniformity of the data and the arc probability for the 
associated PCD. For uniform data, the set probabilities are calculated as the ratio of the area of the set 
to the total area. Suppose the arc probability is geometry invariant. Then fi{Ny) = P(X £ Ny(x)) = 
PiMX) e N^^^y^^iMx))). But PiX g Ny{x)) = A{Nyix))/AiTb) and P{MX) G N^^^y^^iM^))) = 
^(^0e(3^3)(^e(a;)))/A(Te). Moreover A{Ny (x)) / Ain) = Ai^Ny (x))) / A{MTb)) = A{MNy {x))) / AiT,) 
since the Jacobian cancels out and cj)e{Tb) = T^. Hence A{N^^(^y^){(l}eix))) /A{Te) = A{(l)e{Ny{x)))/A{Te) 
implies A{(j)e{Ny{x))) = A{Nff,^(^y^'j{(j)e{x))) for all x £ Tb- The converse can be proved similarly. ■ 
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Corollary 5.3. If (f)e{Ny{x)) ~ N^^^y^-j{(j)e{x)) for all x G Ti,, then the arc probability n{Ny) of the PCD 
based on Ny for uniform data on Tf, is geometry invariant. 

Proof: Let x e Tb. Then (j)e{Ny{x)) = N^^jy.^){4>e{x)) impUcs A{(t)e{Ny{x))) = A{N^^(y^^{(t)e{x))). Hence 
the result follows by Theorem 15.21 ■ 

Definition 5.4. The A/-cdge regions are said to be geometry invariant if (/)e(i?M(ei)) = R^^i^M){4'e{^i)) 
for i = 1,2,3. The M-vertex regions are said to be geometry invariant if (peiRAdiVi)) = R<j,^{M){4>e{yi)) for 
i = 1,2,3. □ 

Corollary 5.5. Suppose Ny is based on geometry invariant edge or vertex regions. If the proximity regions 
are based on boundary of T (y^) and parallel lines to edges, then geometry invariance of the arc probability 
for uniform data follows. 

Proof: Such proximity maps with geometry invariant edge or vertex regions, satisfy (f)e{Ny(x)) = N^^(^y^^((f)e{x)). 
Hence the desired result follows by Corollarv l5.3l ■ 

Corollary 5.6. If the edge or vertex regions are based on specific angles in Th in the sense that their vertices 
have specific angular values, then these regions are not geometry invariant. Similarly if the proximity regions 
are based on specific angles in Tb then they are not geometry invariant either. 

Proof: The transformation (j)^ clearly does not preserve the angles in Tb. Hence the regions dependent on 
(inner) angles of Tb fail to be preserved. ■ 



6 Triangle Centers 

The PCDs will be defined using the vertex and edge regions, which will be constructed using a point, 
preferably, in the interior of the triangle, e.g., a triangle center. Let 3^3 = {vi, 72,73} C be non- 
collinear and T (y^) be the corresponding triangle. The trilinear coordinates of a point P with respect 
to T (y^) are an ordered triple of numbers, which are proportional to the distances from P to the edges. 
Trilinear coordinates are denoted as (a : /3 : 7) and also are kno wn as homogeneou s coordinates or trilinears. 



Trilinear coordinates were introduced by Pliicker in 1835 (see IWeissteinI (|2008f )). The triplet of trilinear 
coordinates obtained by multiplying a given triplet by any positive constant k describes the same point; i.e., 
(a : (3 : j) — {ka : k(3 : ^7), for any fc > 0. By convention, the three ver tices Vi, y 2 , and ya of r(3^3) are 



commonly written as (1 : : 0), (0:1: 0), and (0:0: 1), respectively fsee IWeissteinI (|200 



Definition 6.1. A triangle center is a point whose trilinear coordinates are defined in terms of the edge 
lengths and (inner) angles of a triangle. The function giving the coordinates (a : /? : 7) is called the triangle 
center function. □ 



Kimberlind ( 20081 ) enumerates 360 triangle centers, among which four have been widely known since the 



ancient times; namely, circumcenter (CC), incenter (IC), center of mass or centroid (CM), and orthocenter 
(OC). The point where the center is located in T (3^3) will be labeled accordingly; e.g., Mcc will denote the 
circumcenter of T (3^3). 

The circumcircle is a triangle's circumscribed circle; i.e., the unique circle that passes through each of 
the triangle's three vertices, yi, y2, y3. The center of the circumcircle is called the circumcenter, denoted as 
Mcc, and the circle's radius is called the circumradius, denoted as rcc- By construction, the distances from 
circumcenter to the vertices are equal (to rcc)- Furthermore, the triangle's three edge bisectors perpendicular 
to edges at Mi for i £ {1,2,3} intersect at Mcc- See Figure [51 The trilinear coordinates of Mcc are 
(cos 6*1 : cos 02 '- cos ^3) where 6i is the inner angle of T (3^3) at vertex y^ for i € {1,2, 3} and the trilinears for 
Mcc can also be written as (rcc cos 0i : ^cc cos ^2 : Tcc cos ^3). 
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The circumcenter of a triangle is in the interior, at the midpoint of the hypotenuse, or in the exterior 
of the triangle, if the triangle is acute, right, or obtuse, respectively. See Figure [5] where an acute and an 
obtuse triangle ar e depicted. U s ing th e pdf of an arbitrary angle of a triangle Tj from Poisson Delaunay 
triangulation Dp ( Mardia et al.l ( 1977( )). we see that. 



P {T.J is a right triangle) = P{e = 7r/2) = 0, 
hence P{Mcc is the midpoint of the hypothenuse) = 0. Furthermore, 



P {Tj is an obtuse triangle) 



P{Mcc ^Tj) = P{9n,,^>Tr/2) = / h{x)dx 

Jtt/2 



(3/s(V2¥)-/c(V2¥)-3/s 



where 



.03726 



- (3a:: (sin 2x) — cos 2x + cos Ax — tt sin 2x) 



12^ 



I(7r/3 < .T < 7r/2)) + 



(47r(cos a:;)(sin a;) + 3sina;^ — cosx'^ — 4x(cosa;)(sina;) + 1) 



I(7r/2 < a; < tt) 



is the pdf of the maximum angle. fc{x) ~ / cos(7r 

Jo 



t^/2)dt, and fs{x) 



I sin(7r t^/2)(it arc the Fresnel co- 
Jo 



sine and sine functions, respectively. The coordinates of Mcc ij^i the basic triangle Tf, are 



1 cf-ci 



2C2 




Figure 5: The circumcenter, circumcircle, and circumradius of an acute triangle (left), an obtuse triangle 
(right). 

The incircle is the inscribed circle of a triangle, i.e., the unique circle that is tangent to the edges Cj at 
Pi for i G {1,2,3}. The center of the incircle is called the incenter, denoted as Mj, and the radius of the 
incircle is called the inradius, denoted as r^c- Incenter has trilinear coordinates (1:1:1). The incenter is 
the point where the triangle's inner angle bisectors meet. See Figure [S] (left). 

The coordinates of Mi for the basic triangle Tf, are (x^c, Vic), where 



ci - Vc? + 4 



1 + y/cl +4 + y/{l - Ci)2 + d 



C2 
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Note that. Mcc and Mj do not necessarily concur. The distance between Mcc and Mj is d{Mcc, Mj) = 
\/ reduce ~'2ric). Unhke the circumcenter, the incenter is guaranteed to be inside the triangle. 

The median line of a triangle is the line from one of its vertices to the midpoint of the opposite edge. 
The three median lines of any triangle intersect at the triangle's centroid, denoted as Mc- The centroid 
is the center of mass of the vertices of a triangle. Since Mq is also the intersection of the triangle's three 
median lines, it is sometimes called the median point. It has trilinear coordinates (l/|ei| : l/|e2| : l/lcsl) 
(esc 6*1 : esc 01 : csc^i) where denotes the edge opposite to the vertex for i G {1,2,3}. The centroid is 
also guaranteed to be in the interior of the triangle. See Figure [6] (right). The coordinates of Mc for the 
basic triangle are ((1 + ci)/3, C2/3). 




Figure 6: The incircle, incenter, inradius of a triangle (left), and the centroid or center of mass of a triangle 
(right). 

The intersection of the three altitudes of a triangle is called the orthocenter, Mq, which has trilinear 
coordinates (cos ^2 cos 6*3 : cos^icos^a : cos0icos02)- The orthocenter of a triangle is in the interior, at 
vertex ys, or in the exterior of the basic triangle, Tb, if Tf, is acute, right, or obtuse, respectively. The 
functional form of Mq in the basic triangle is (ci, ci (1 — ci)/c2). 

Note that in an equilateral triangle, Mj — Mcc = = Mc (i.e., all four centers we have described 
coincide). 

7 Vertex and Edge Regions 

The new proximity maps will be based on the Delaunay cell Tj that contains x. The region Ny{x) will also 
depend on the location of x in Tj with respect to the vertices or faces (edges in M^) of Tj. Hence for Ny{x) 
to be well-defined, the vertex or face of Tj associated with x should be uniquely determined. This will give 
rise to two new concepts: vertex regions and face regions {edge regions in R^). 

7.1 Vertex Regions 

Let 3^3 = {yi,y2,y3} be three non-collinear points in and r(3^3) ~ r(yi,y2,y3) be the triangle with 
vertices 3^3. Then for x G T (y^), Ns{x) = B{x,r{x)) where r{x) = miuyg^yg d{x,y). That is, r{x) = d{x,yi) 
iff a; G VciVi) H r(3^3) for i G {1,2,3}, where Vc(yi) is the Voronoi cell generated by y^ in the Voronoi 
tessellation based on 3^3. Notice that these cells partition the triangle T {y^) and each VciVi) n T {y^) is 
adjacent only to vertex y^ and their intersection is the point M which is equidistant to the vertices, so M is 
in fact the circumcenter, Mcc, of T (3^3). See Figure [71 To define new proximity regions based on some sort 
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of distance or dissimilarity relative to the vertices 3^3, we associate each point in T (3^3) to a vertex of T (3^3) 
as in the spherical case. This gives rise to the concept of vertex regions. Note that Ns{x) is constructed 
using the vertex region based on the closest vertex, argminyg-y^ d{x,y). If two vertices were equidistant from 
X (i.e., argmiUyg-y^ d{x,y) were not unique), x is arbitrarily assigned to a vertex region. In fact, for Ns, by 
construction, it would not matter which vertex to pick when the vertices are equidistant to x, the region 
Ns{x) will be the same. 

Definition 7.1. The connected regions that partition the triangle, T (3^3), (in the sense that the intersections 
of the regions have zero R^-Lebesgue measure) such that each region has one and only one vertex of T (3^3) 
on its boundary are called vertex regions. □ 

This definition implies that there are three vertex regions. In fact, the vertex regions can be described 
starting with a point M S \ 3^3. Join the point M to a point on each edge by a curve such that the 
resultant regions satisfy the above definition. Such regions are called M -vertex regions and we denote the 
vertex region associated with vertex y as Rm{)/) for y £ 3^3. In particular, one can use a center of the triangle 
T {y^) as the starting point M for vertex regions. See the discussion of triangle centers in Section [51 The 
points in i?M(y) can be thought as being "closer" to y than to the other vertices. 

It is reasonable to require that the area of the region i?A/(y) gets larger as d{M,y) increases. Usually 
the curves will be taken to be lines or even the orthogonal projections to the edges. But these lines do not 
necessarily yield three vertex regions for M in the exterior of T(3^3). Unless stated otherwise, M- vertex 
regions will refer to regions constructed by joining AI to the edges with straight line segments. 



7.1.1 M- Vertex Regions 

For M <£ T (3^3)°, M- vertex regions are defined by two ways: 

(I) Geometrically, one can construct M-vertex regions by drawing the orthogonal projections to the 
edges, denoted as Rj^{y). For instance see Figure [7] with M = Mqc- 

The functional forms of Rfj{y) for M = (mi, 7712) in the basic triangle are: 

Riiiyi) = {(2;, y) (3^3) ■.x<mi; y< m.2 - {x - mi) C1/C2} , 

-RM(y2) = {{x,y) e T(3^3) ■.x>mi; y < m2 + (1 - Ci) {x - mi)/ 02} , and 

Riiiy^) = {(2;, y) e ^'(3^3) ■y>m2-ci{x~ mi)/c2; y > m2 + {1 ~ ci) (x - mi)/c2} . 

However, the orthogonal projections from M to the edges does not necessarily fall on the boundary of T (3^3). 
For example, letting be the orthogonal projection of M to edge 62, it is easy to see that might fall 

outside T (3^3) which contradicts the definition of vertex regions. In fact P2' e 62 iff — "^f^ — — < 02 

ci + 

iff C2 (C2 - m.2) + Ci(ci - JTli) > 0. 

(II) One can also construct Af -vertex regions with M E T (3^3)° by using the extensions of the Hne 
segments joining y to M for all y G 3^3. See Figure [9] with AI = Mc. The functional forms of i?7\/(yi) for 
i e {1, 2, 3} with M = (mi, m2) and mi > ci in the basic triangle, Tf,, are given by 

T) ( \ l( \r-rrfM\ ^ "^2 {x - 1) m2(ci -x) + C2(x-mi) 

■Rm Yi = i i.x,y) € r(3^3) : y < — ; y < 

mi — 1 ci — mi 

u / \ ^( ^ "^2 3; m2(ci -a;)+C2(a;-mi)1 

^^M(y2) = Ux, y) e T (3^3) : y < ; y > )■ , and 

mi ci — mi J 

„ , s ( , N „ / N rn2 X mo (x — 1) 

RAiiys) = {x,y) € T(3^3) : 2/ > y > — — 

1^ mi mi — 1 

For mi < ci, i?A/(yi) for i G {1, 2, 3} are defined similarly. 



16 



If X falls on the boundary of two M- vertex regions, then x is arbitrarily assigned to one of the Af -vertex 
regions. 

To distinguish between these two types, the vertex regions constructed by using orthogonal projections 
are denoted as R^{y) and the vertex regions constructed by using the lines joining vertices to M are denoted 
as Rm{y)- By definition, Rociv) and Rqq{y) are identical. But, for Af = Mq, Mqc, Mj, i?j\/(y) can have 
both versions, so the above distinction is necessary for them. 

7.1.2 CC- Vertex Regions 

The region Vc(y) H 2^(3^3) is a special type of vertex regions, which can also be obtained geometrically by 
starting at Mcc and drawing the orthogonal projections to the edges. Hence these regions are called CC- 
vertex regions. One can also construct CC- vertex regions by drawing the perpendicular (mid)cdge bisectors 
or by finding the circunicenter and drawing the orthogonal projections to the edges. See Figure [71 where Mj 
are the midpoints of the edges. 




Mcc 



Figure 7: The CC-vertex regions in an acute triangle (left) and in an obtuse triangle (right). 



The functional forms of RcciVi) for i £ {1,2,3}, in the basic triangle Tf, = ((0, 0), (1, 0), (ci, C2)) (see 
Section [5]) are given by 

Rcciyi) ^{{x,y)eT{y,):x<l;y< + ' ^ ^ 
L 2 2C2 

(, , , , 1 c? + - 1 + 2 (1 + Cl).T 

L 2 2C2 

Rcc{y3) = <{x,y)eT{y3):y>^ 2_ ■y> 



2C2 



2C2 



One can also define CC-vertex regions by using the line segments which join Mqc to edge and are 
extensions of the lines joining Mcc to the vertex Vi for i £ {1, 2, 3}, but this definition only works for acute 
triangles, since Mcc ^ T (y^)" for non-acute triangles. 

7.1.3 C7\/- Vertex Regions 



The motivation behind CAf-vertex regions is that unlike the circumcenter, center of mass is guaranteed to 
be inside the triangle. We define the CAf-vertex regions using the median lines and denote the regions as 
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RcAiiVi) for i £ {1,2,3} (see Figure [H]). However, the method with orthogonal projections of Mc to the 
edges does not always work. Let Pf^^ be the point at which orthogonal projection of Mc on crosses 
for i e {1, 2, 3}. Then, P^^ might fall outside T (3^3) in which case RcAiiy^) adjacent to two vertices yi 
and ya, while RcAiivs) is not adjacent to any of the vertices. Hence the definition of the vertex regions is 

violated. In fact € 62 iff o^^. t < C2 iff 2 + 2 - ci > 0. See Figure [8] for the domain of 



(ci,C2) inn forPl^^^ €62. 



3(4+4) 




Figure 8: Depicted is the domain of (ci, C2) where G 62 




Figure 9: The CM- vertex regions with orthogonal projections (left) and with median lines (right). 
The functional forms of RcAiiYi) for * G {1, 2, 3} in the basic triangle Tf, arc given by 

RcMiy.) - I (., y) eTiy,):y< ^ < - ^ 



i?cA/(y2) = |(x-,y)er(3^3):y< 
RcM{y3) = l{x,y)eT{y3):y> 



2ci- 


- 1 


C2 (2 a; 


-1) 


2ci - 


- 1 


C2 [x - 


1). 



ci - 2 



; y< 
y> 



ci - 2 

C2 X 
1 + Cl 
C2 X 



1 + Ci 
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7.1.4 /C- Vertex Regions 



One can also define the incenter vertex regions by using the inner angle bisectors. With orthogonal projec- 
tions, /C-vertex regions are bounded by the edges of r(3^3) and the inradii crossing the tangential points 
of the incircle on the edges. These three regions are denoted as R-j-(j{y) for y G 3^3. With the inner angle 
bisectors, the incenter Mj is used and the parts of the inner angle bisectors that join Mj to the edges. These 
vertex regions are denoted as i?/c(y)- See Figure [TOl for both versions of the vertex regions. 

Note that one might also use the orthocentcr. Mo, to define the vertex regions. However, for non-acute 
triangles OC-vertex regions cannot naturally be defined. 




Figure 10: The /C-vertex regions with orthogonal projections (left) and with extension of the line segments 
joining the vertices to Mjc (right). 



7.2 Edge Regions 

The spherical proximity region seen earlier is constructed by using the vertex region based on the closest 
vertex, argmin^g-y^ d{x, y). One can also use the closest edge, argmin^gij^ 2,3} d{x, e^), in defining a proximity 
region, which suggests the concept of edge regions. 

While using the edge argmin^g^-]^ 2 3}d{x,ei), the triangle is again partitioned into three regions whose 
intersection is some point M with Euclidean distance to the edges d(Af, ei) = d{M, 62) = d{M, 63), so M is 
in fact the incenter of T (3^3) and d{M, e) = r^c is the inradius (see Section [5] for incenter and inradius). 

Definition 7.2. The connected regions that partition the triangle, T(3^3), in such a way that each region 
has one and only one edge of T (3^3) on its boundary, are called edge regions. □ 

This definition implies that there are exactly three edge regions which intersect at only one point. M in 
T(3^3)°, the interior of T(3^3). In fact, one can describe the edge regions starting with M. Join the point 
M to the vertices by curves such that the resultant regions satisfy the above definition. Such regions are 
called M-edge regions and the edge region for edge e is denoted as i?j\/(e) for e € {ei, 62, 63}. Unless stated 
otherwise, M-edge regions will refer to the regions constructed by joining M to the vertices by straight lines. 
In particular, one can use a center of T (y^) for the starting point M. One can also consider the points in 
-RM(e) to be "closer" to e than to the other edges. Furthermore, it is reasonable to require that the area of 
the region Rm{&) get larger as (i(M, e) increases. Moreover, in higher dimensions, the corresponding regions 
are called "face regions" . 

The functional forms of RMi&i) for i G {1,2,3}, for M = (7711,7712) G T {y^)° and 7711 > ci in the basic 
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triangle are given by 



n . X ]/ ^m2ix-l) C2 (a; - mi) - ma (x - ci) 

RM(ei) = < {x,y) eT{y3) : y > — —; y > 



mi — 1 



ci — mi 



„ / V I , s „ , , ma a; ca (x — mi ) — m? (a; — ci ) , 

62 = e TCV3 : y > 2/ < } , and 

' mi Ci — mi 

RAiiea) = i (a;,?/) e T(3^3) : ; y < — 

mi mi — 1 



If X falls on tlie boundary of two M-edge regions, then x is arbitrarily assigned to one of the M-edge 
regions. 

The center of mass edge regions (CM-edge regions) are described in detail, as we will use them in defining 
a new class of proximity maps. 

7.2.1 CM-Edge Regions 

One can divide T (3^3) into three regions by using the median lines which intersect at the centroid, or 
equivalently, joining the centroid Mc to the vertices by straight lines will yield the CM-edge regions. Let 
Rcmie) be the region for edge e £ {ei, 62, 63}. See Figure [TT] (left). 

The functional forms of RcKii^i) for i € {1, 2, 3}, in the basic triangle. Tf,, are given by 

„ . N f/ X ^/ N €2(1 — 2 x) C2 (1 —■ x) 

RcMiei)^ \{x,y)eT{y3):y< ^ ^„ ;y> ^ ^ 



1 - 2ci 



2-ci 



RcMie2) = < (x, y) eT{ys):y> ^^.LM; y > x 



1 - 2ci 



1 + Cl 



RcAiies) = { ix,y) eT{y,):y< y < £iiL^ 

' 1 + ci 2 — ci 




Figure 11: CM-edge regions RcM{^i) (right) and Ric{ei) (left) for i e {1,2,3}. 

Remark 7.3. One can also divide T (y^) into three regions by using the inner angle bisectors which inter- 
sect at the incenter, yielding the incenter edge regions (/C-edge regions). Let Ric{e) be the region for 
edge e € {61,62,63}. Notice that the closest edge to any point in i?/c(e) is edge e, i.e., x e Ric{e) iff 
argmin^gjgj^ g^jd(x,u) = e. If two edges are equidistant from x, x is arbitrarily assigned to an edge 
region. See Figure [TT] fright). □ 
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Remark 7.4. In R, one can view the end points of [0, 1], {0, 1}, as vertices or edges. So [0, 1/2] and [1/2, 1] 
can be viewed as either vertex regions or edge regions. □ 



8 Proximity Regions in Delaunay Tessellations 



Let y„i = {yi, . . . , Ym} be m points in general position in K.'' and Tj be the j*'* Delaunay cell for j = 1, . . . ,J. 
Let also that A'„ be a random sample from F with support S{F) C Ci/(3^m). That is, = CH(3^m) and the 
flj = Tj with /i being the Lebesgue measure. Then the appealing properties for proximity regions in Section 
m become: 

PI N{x) is well defined for aU x e Cuiym)- 

P2 a; e N{x) for all x G Ci/(3^,„). 

P3 a: is at the center of N{x) for all x e C_ff(3^Tn)- 

P4 For X E Tj C CH{ym), N{x) and Tj are of the same type; they are both {d + l)-simplicies. 

P5 For x G C C^f (3^m), N{x) mimics the shape of Tj] i.e., it is similar to Tj. 

P6 Conditional on X E Tj, N{X) is a proper subset of Tj a.s. 

P7 For X £ Tj and y E Tk with 7 7^ fc, A^(a;) and are disjoint a.s. 

P8 The size of N{x) is continuous in x; that is, for each £ > there exists a 6{e) > such that |/i(iV(y)) — 
^{N{x))\ < 5(e) whenever ||?/ — x|| < e. 

P9 The arc probability n{Ns) does not depend on the support region for uniform data in R'*. 

In particular, for illustrative purposes, we focus on R^, where a Delaunay tessellation is a triangulation, 
provided that no more than three points of are cocircular. Furthermore, for simplicity, let 3^3 = {yi, y2, Ys} 
be three non-coUinear points in R^ and ^(3^3) = T{\/i,\/2,yz) be the triangle with vertices 3^3. Let Xn be 
a random sample from F with support S(F) ^ T (y? ,) - The spherical proxim i ty ma p is the first p r oximit y 



A ranuom sample nom r wiin supporu o\r } 1 -^ne spuericai proxmmy ma p is une msu p r oxmnu y 

map defined in liter ature fsee DeVinnev et al.l (|2002[). iMarchette and Priebe (|2003[ ). |Priebe et all (|2003al ). 



Priebe et al.l ( 2003b ). and DeVinnev and Priebd l 20061 ) ) where CC- vertex regions were implicitly used for 
points in Ch (3^m)- In the following sections, we will describe arc-slice proximity maps Nas{') and define 
two families of triangular proximity regions for which P4 and P5 will automatically hold. 



8.1 Arc-Slice Proximity Maps 

Recall that for Ns{ ) P7 is violated, since for any x E Tj EL W^, B{x,r{x)) (f. Tj, which implies that 
two proximity regions Ns{x) and Ns{y) might overlap for x,y in two distinct cells. Such an overlap of 
the regions make the distribution of the domination number of the PCD associated with Ng{-), if not 
impossible, hard to calculate. In order to avoid the overlap of regions B{x,r(x)) and B{y,r{y)) for x, y in 
different Delaunay cells, the balls are restricted to the corresponding cells, which leads to arc-slice proximity 
regions, Nas{x) ■= B{x,r{x)) r\Tj, where B{x,r{x)) is the closure of the ball B{x,r{x)). The closed ball is 
used in the definition of the arc-slice proximity map for consistency with the other proximity maps that will 
be defined on Delaunay cells. The arc-slice proximity map Nas{x) is well-defined only in Ch (3^m), provided 
that ym is in general position and m > d + 1 in R''. 

By construction, the CC-vertex regions are implicitly used, since x E Rcciy) iff y = argmin^g-y^ d{x, u). 
To make this dependence explicit, the notation Nas{', Mcc) is used. See Figure [T^ (top) for Nas{x, Mac) 
for an a; G Rcciy 2)- The functional form of NAsix, Mcc) for an x G Rcc{y) is given by 

Nas{x,Mcc) := {z eTiys) : d[z,x) < r{x) = d(x,y)}. 
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Notice that, the region Nas{x, Mcc) is a closed region, unHkc Ns{x) = B{x, r{x)). The properties PI, P2, 
P7 hold by definition. Notice that NAsix, Mcc) C T (3^3) for all a; G T (3^3) and Nas{x, Mcc) = T (3^3) iff 
X = Mcc, since B{x,r(x)) D ^'(3^3) only when x = Mcc- Hence the superset region for arc-slice proximity 
maps with CC-vcrtcx regions is I%s{Nas, Mcc) — {Mcc}- So P6 follows. Furthermore, P8 holds since 
the area A {Nas{x, Mcc)) is a continuous function of r{x) — miuy^y^ d{x,y) which is a continuous function 
of X. P3, P4, P5, and P9 fail for Nas{x,Mcc)- See Figure [T^ for the arcs based on Nas{x,Mc) for a 
realization oil X points in the one triangle case, and Figure [Ml for the arcs for the realization of 77 X points 
in the multi-triangle case in Figure [2] (top right). 




Figure 12: Nas{x,Mcc) with an x e Rcciy-i) (top) and the superset regions ^%s[Nas,Mc) (bottom left) 
!^.s{Nas,Mic) in T (3^3) (bottom right). 

One can define arc-slice proximity regions with any type of M-vertex regions as 

Nas{x,M) ■.= ^{x,r{x))C^T{y^) where r{x) := (i(.T,y) for x e Rm{)/)- 

But for M ^ Mcc-, Nas{-,M) satisfies only PI, P2, and P7. P6 fails to hold since 3is{NAS-,M) has 
positive area and P8 fails since the size of Nas{xtM) is not continuous in x. See, for example. Figure 
[12] (right) for 3£s{Nast Mc)- In terms of the properties in Section IH Nas{',Mcc) is the most appealing 
proximity map among the family ,jVas '-= {NAsi', ^^) '- M €R'^ \ 3^3}. 

Moreover, Ao{Nas, M) = 3^3 for all M €R^\ 3^3 since X{Nas{x, M)) = iff a; G ^3- 
Next, we define triangular proximity regions^ which, by definition, will satisfy properties P4 and P5. 
These proximity regions are the building blocks of the PCDs for which more rigorous mathematical analysis 
— compared to the PCDs based on spherical and arc-slice proximity maps — will be possible. 
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Figure 14: The arcs for arc-slice PCDs with N^'^{x, Mc) for the 77 X points that he in the Ch (3^io) (see 
Figure [2]). 



8.2 Proportional-Edge Proximity Maps 

The first type of triangular proximity map introduced is the proportional-edge proximity map. For this 
proximity map, the asymptotic distribution of domi nation number and the re l ative density o f the c orre- 
sponding PCD will have mathematical tractability (jCevhan and Priebd (|2005[) . ICevhan et all (|2006l) and 
Cevhan and Priebd (|2007t )). 



For r £ [l,c»], define Np^[-,M) := N {■ , M]r,y^) to be the proportional- edge proximity map with M- 
vertex regions as follows (see also Figure [TSl with M = Mc and r = 2). For x €T (3^3) \ 3^3, let v{x) S 3^3 be 
the vertex whose region contains X] i.e., x G RMiv{x)). If x falls on the boundary of two M-vertex regions, 
v{x) arbitrarily assigned. Let e{x) be the edge of T(3^3) opposite v{x). Let i{v{x),x) be the line parallel 
to e(x) through x. Let d{v{x)^ £{v(x), x)) be the Euclidean (perpendicular) distance from v{x) to i{v{x)^x). 
For r e [1, 00), let £r{v{x), x) be the line parallel to e{x) such that 

d{v{x), £r{v{x), x)) = r d{v{x), £{v{x), x)) 
and 

d(£{v{x), x), £riv{x), x)) < d{v{x), £riv{x), x)). 



Let Tr{x) be the triangle similar to and with the same orientation as T (y^,) having v{x) as a vertex and 
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£r{v{x), x) as the opposite edge. Then the r-factor proportional-edge proximity region Np^{x, M) is defined 
to be Tr(x)nT (3^3). Notice that €(w(a;), a;) divides the edges of Tr(a;) (other than ^^(^(a;), x)) proportionally 
with the factor r. Hence the name proportional edge proximity region and the notation Np^[-, M). 

Notice that ?- > 1 implies x £ Np^{x, M). Furthermore, linir^oo Np^{x, AI) = T (y-s) for all x E 
Tiya) \ ys, so NP'e{x,M) := T(3^3) for all such x. For x S 3^3, iV^£;(a;,M) := {a;} for all r G [l,oo]. See 
Figure [TBI for the arcs based on Np^{x, Mc) in the one triangle and the multi-triangle cases. 

Notice that Xi ~^ F, with the additional assumption that the non-degenerate two-dimensional pdf / 
exists with support S{F) C T{yz), implies that the special case in the construction of Np^ — X falls on 
the boundary of two vertex regions — occurs with probability zero. Note that for such an F, Np^{X, M) is 
a triangle a.s. 




Figure 15: Construction of proximity region. Np^(x) (shaded region) for an x G RcAiiyi)- 




Figure 16: The arcs for Np~j^{x, Mc) the 7 X points in Figure [T3l and the arcs for Np~j^{x, Mc) for the 77 
X points that lie in the Ch (3^io) in Figure [21 

The functional form of Np^{x, M) for x ~ (xq, yo) E Th is given by 
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N' 



PE 



N' 



PE 



(a;, M) ^{ix,y)eT,:y< !:(M£i^£l(2i±l)K£2±£l£ | for a; G i?M(y2), 



A^^£(a;,M) = £ : y > 2 yo - C2(r - 1)} for x £ i?A/(y3). 

Of particular interest is Np^{x, M) with any A/ and r <E {\/2, 3/2,2}. For r = \/2, divides 

T^{x) into two regions of equal area, hence N^{x, M) is also referred to as double-area proximity region. 
For r = 2, £{v{x), x) divides the edges of T2{x) — other than £r{v{x),x) — into two segments of equal length, 
hence Np^{x, M) is also referred to as double-edge proximity region. For r < 3/2, lMs{Npp;, Mc) = 0, and 
for r > 3/2, Ms^N^pp,, Mc) has positive area; for r = 3/2, ^s(iV^^,Mc) = {Mq}. Therefore, r = 3/2 is 
the threshold for Np^{-,Mc) to satisfy P6. Furthermore, r = 3/2 is the value at which the as ymptotic 
distrib utior i of the domination numbe r of the PCD based on Np^{-, Mc) will be nondegcnerate (see lCevhanI 
(|2004 and lCevhan and Priebd (l2005l )). 

The properties PI, P2, P4, P5, and P7 follow by definition for all M and r. Furthermore P9 holds, 
since Npp< is geometry invariant for uniform data. Property P5 holds with similarity ratio of Npp,{x, M) to 



7^(3^3): 



min(d(u(x), e{x)),r d{v{x), £{v{x), x))) 



that is, Np^{x, M) is similar to T (3^3) with the given ratio. 



e{x)) 

P6 holds depending on the pair M and r. That is, there exists an :— ro{M) so that Np°^{x, M) satisfies 
P6 for all r < ro(A/), and fails to satisfy otherwise. P6 fails for all M when r ~ 00. P8 holds only when 
M = Mc- With CM-vertcx regions, for all r G [1, 00], the area A {Npp.{x, Mq)) is a continuous function of 
d{£r{v{x), x), v{x)) which is a continuous function of d{l{v{x), x), v{x)) which is a continuous function of x. 

Moreover, Ko{Npp,,M) — for all r e [l,oo] and M e \ 3^3, since the R^-Lebesgue measure 
A(iV|,£(.T,M)) =Oiff a;e3^3. 

As for P3, for T^i^x) C r(3^3) one can loosen the concept of center by treating the line t{v{x),x) as 
the edge-wise central line, so P3 is satisfied in this loose sense for r = 2. Notice that x is not the unique 
center in this sense but a point on a central line. Let Mi, i e {1,2,3}, be the midpoints of the edges of 
T (3^3), and T{Mi, M2, M3) be triangle whose vertices are these midpoints. Then for any x € T{Mi, M2, M3), 
7V|,£;(x, M) = T (3^3), so r(Mi, M2, M3) C ^s{NIe,M) where equality holds for M = Mc for all triangles 
and for M = Mcc in non-obtuse triangles (see Figure flTl (left)). 




Figure 17: Superset region^s {^pe^ ^^cc) in an acute triangle (left), superset region, ^s{Npp,, Mj) (right) 

For an example of double-edge proximity regions Npp,{x,M) with CC-vertex regions with orthogonal 
projections, see Figure [T51 (top left). Notice that we use the vertex closest to x; i.e., argmiUyg-y^ d{x,y) for 
Npp,{x,CC), i.e. vertex regions Rcc{-)- Furthermore, if x is close enough to M, it is possible to have 
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N^e{x,M) = r(3^3). Sec Figure [m (bottom) for an 
projections. 



example with CC-vertex regions with orthogonal 




O 0.2 0.4 o.e 0.8 1 O 0.2 0.4 o.e O.S 1 

0.8 -| 




Figure 18: Shaded regions are double-edge proximity regions Np^{x,M) C T [y^] with M — Mcc for an 
X £ Ftcciy'^) (top left), with M = Mj for an a; S Rj-^{y2) (top right). Depicted in the bottom is an example 
of iV|,£;(x, M) = T (3^3) wfth M = Mcc for an x G Rcciy^)- 

In non-obtuse triangles, (^pb, Mcc) = r(Aii, M2, M3). But, in obtuse triangles, {NpE,Mcc) 2 
T{Mi, M2, M3) and is a quadrilateral. The functional form of the superset region, Ms {Np-^, M), in Ti, is 
given by 



,s {N'PE.M) ^{{x,y)^RM{yi):y> 

C2 (r (x-l) + l) 



(x,?/) e i?M(y2) : y > 



r ci 



|j|(a;,y) e i?M(y3) :y< 



r - 1 



C2- 



and the functional form of T{Mi, M2, M3) in is given by 



rrfnr nj i\f \ )i \ ^ rp ^ C2 C2 (-1-^2 a;) C2 (1 - 2g;) 

2 2 ci 2(1 — ci) 



Let {Npp, M) be the superset region for Npp based on M- vertex regions with orthogonal projections. 
See Figureinifor the superset region M^ {Nip, Mj). Again r(Mi, M2, M3) C .^^{Nlp,Mi) for aU T (3^3) 
with equality holding when T (3^3) is an equilateral triangle. For Nlp{-, Mc) constructed using the median 
lines Ms {N"lp,Mc) = T(Mi,M2,M3) and for Nlp{-,Mc) constructed by the orthogonal projections, 
{Nip, Mc) 2 r(Mi, M2, M3) with equality holding when T (3^3) is an equilateral triangle. An example 
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of double-edge proximity regions is given in Figure [15] (top right) where JC-vertcx regions with orthogonal 
projections to the edges is used. We could also use /C-vcrtex regions obtained by inner angle bisectors. Note 
also that the superset region ^^(iV|,^,M/) is as in Figure [TT] Again r(Mi,M2,M3) C M^{NIj^,Mi) for 
aU T (J^'a) and r(Mi, M2, M3) = .^gi^PE^ ^^i) ^ (3^3) is an equilateral triangle. 

For r — v^, one can loosen the concept of center by treating the line i{v{x),x) as the area-wise central 
line in N^{x, AI), so P3 is satisfied in this loose sense. For an example of N^{x,M) with CC-vertex 
regions with orthogonal projections, see Figure fT9l (top left). Ms (^N^, Mcc^ has positive area; see Figure 

1201 An example of double-area proximity region with /C-vertex regions is given at Figure [19] (top right) 
with orthogonal projections to the edges. Note that if x is close enough to Af , it is possible to have 
N^{x, M) = T {y^). See Figure [T91 (bottom) with CC-vertex regions with orthogonal projections. We 
could also use /C-vertex regions obtained by inner angle bisectors. 




o 0.2 0.4 o.e 0.8 1 



Figure 19: Shaded regions are double-area proximity regions Np^{x, M) C r(3^3) with M = Mcc for an 
x e Rcciy^) (top left), with M = AIj for an x G Rjc{y2) (top right) and N^{x,M) = ^(^3) for an 
X e R^(j{y2) with M = Mcc (bottom). 

Note also that N^{x, Mi) ~ T (y^) might occur if x is close enough to Mj when Mj ^ 3"^. Let 
/ij be the altitude of r(3^3) at vertex y^, for j = 1,2,3. If r^c < (^^) foi' some j G {1,2,3}, 
then Sf.s (^N^, Mj^ has positive area. See Figure where the superset region is barely noticeable. If 
fie > ("^^^7^) ™axjg{i_2,3}{/ij}, then Ms [n^, Mj^ has zero area. In T^, r^c > max{/ii, /13} always 

hold, but r,c > (^) h2 holds iff cf + c^ < (V2- l)' (l + ^ (l - ci)'^ + iff jeal < (V2- l) (l + |ei|). 

In T (3^3), drawing the lines qi{r, x) such that d{yi, e;) r d{qi{r, x), yi) for i e {1, 2, 3} yields a triangle, 
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Figure 20: The superset region, (^N^,M^ with M = Mcc (left) and M = Mj (right). 

for r < 3/2 . See Figure [21] for with r = \/2. The functional form of in Tb is 
^ = <(x,y) eTt : y > — i y < — y < — } = T (ti(r),t2(r) ,tz{r)) 



= T 



r ' r(l — ci) ' rci 

(r-l)(l + ci) C2(r-1)\ /2-r + ci(r-l) C2(r-1)\ /ci(2-r)+r-l Ca (r - 2) 



(1) 



There is a crucial difference between and r(A/i, Afa, M3): T{Mi,M2,M'i) C ^5(iV];^,Af) for all A/ 
and r > 2, but (J^'')" and Ms (N^e.M) are disjoint regions for aU M and r. So if A/ e (5^' )°, then 
MsiN^E^M) = 0; if A/ e (9(3^0, then ^s(iV^£,Ar) = {Af}; and if Af ^ ^^ then ^s(iV|,^,Af) has 
positive area. Thus Np^i-, A/) fails to satisfy P6 if M ^ 3"^ . The triangle 3"^ defined above plays a crucial 
role in the analysis of the distribution of the domination number of the proportional-edge PCD. In fact, it 
has been shown that for M £ \t\(r\t2(r\t-iir)\ there ex ists a specific value of r for which the asymptotic 
distribution of the domination number is non-degenerate ( Cevhan and Priebe ( 2007h ). The superset region 
Ms {NpEi M) will be important for both the domination number and the relative density of the corresponding 
PCDs. 




Figure 21: The triangle 

In non-acute triangles, the above condition holds for M ~ Mcc: since in right and obtuse triangles, 

Mcc i T{y:i)° ^ Mcc i (since T {y:^) D .3^). For an acute basic triangle, if y^c < "'^ 1^"^^"'' 

2 (1 - ci) 
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holds, then Mcc ^ '"^^ ^ and the superset region for such triangles is the triangle with vertices Mcc, 

— — ro — — — I _ anrl I — 

2ci 




Remark 8.1. In terms of the properties stated in Section [3J Np^{-, Mq) is the most appealing proximity 
map in the family := {Np^{-, M) : r G [1, oo], M € \ 3^3}. It is also noteworthy that the asymptotic 

distribution of the domination number of the PCD based on Np^^{-,Mc) is nondegcnerate. □ 

Remark 8.2. 

• For n < r2, N^p'e{x, M) C N''p%{x, M) for all a- G T (3^3). For n < ra, iVp^(a;, M) C N];-p,{x, M) with 
equality holding for only 2; G 3^3 or x G (a^p]j"''''''''\ . 

• For 3/2 < n < r2, (iVp^, M) C Ms {N^p'e^ M) and (^pb, Af) = for r < 3/2. 



For ri < ra, A(A^p£(X,M)) A(7Vp^(X,M)) for X from a continuous distribution on T (3^; 

where <'^-'" stands for "stochastically smaller than". □ 



3) 



8.2.1 Extension of Np^ to Higher Dimensions 

The extension to M.'^ for d > 2 is straightforward. The extension with M = Mc is given her, but the 
extension for general M is similar. Let 3^d+i = {yi, y2, • • • , Vd+i} be d + 1 points that do not He on the same 
d— 1-dimensional hyperplane. Denote the simplex formed by these d+1 points as (S(3^d+i). A simplex is 
the simplest polytope in M.'^ having d+1 vertices, d{d+l)/2 edges and d+1 faces of dimension {d— 1) . For 
r G [1,00], define the proximity map as follows. Given a point x in (S(3^d+i), let v :~ argmin^g-y^^^ Y{Qy{x)) 
where Qy{x) is the polytope with vertices being the d{d + l)/2 midpoints of the edges, the vertex v and 
X and V(-) is the d-dimensional volume functional. That is, the vertex region for vertex v is the polytope 
with vertices given by v and the midpoints of the edges. Let v{x) be the vertex in whose region x falls. If 
X falls on the boundary of two vertex regions, v{x) is assigned arbitrarily. Let ip{x) be the face opposite 
to vertex v{x), and T{v{x),x) be the hyperplane parallel to (p{x) which contains x. Let d{v{x),T{v{x),x)) 
be the (perpendicular) Euchdean distance from v{x) to T{v{x),x). For r G [l,oo), let Tr{v{x),x) be the 
hyperplane parallel to f^x) such that 

d{v{x), Tr{v{x), x)) = r d{v{x), T(w(x), x)) 
and 

d{T{v{x),x),Tr{v{x),x)) < d{v{x),Tr{v{x),x)). 

Let ©^(2;) be the polytope similar to and with the same orientation as & having v{x) as a vertex and 
Tr{v{x),x) as the opposite face. Then the proximity region Np^{x, Mc) '■= &r{x) n S(3^d+i)- Notice that 
7' > 1 implies x G Np^{x, Mc)- 



8.3 Central Similarity Proximity Maps 



The other type of triangular proximity map introduced is the central similarity proximity map. This will 
turn out to be the most appealing proximity map in terms of the properties in Section [H Fu r therm ore, the 



relative density of the corresponding PCD will have mathematical tractability (jCevhan et al.l (l2007f ) ). Ala s 



the distribution of the domination number of the associated PCD is still an open problem (jCevhanI ( 20041 )). 

For r G [0, 1], define N^g(-, M) := N (•, Af; r, 3^3) to be the central similarity proximity map with M-edge 
regions as follows; see also Figure [22] with M = Ale- For x £T (3^3) \ 3^3, let e{x) be the edge in whose region 
X falls; i.e., x G i?7\/(e(a;)). If x falls on the boundary of two edge regions, e{x) is assigned to x arbitrarily. 
For T G (0, 1], the central similarity proximity region Ncg{x, M) is defined to be the triangle Tt{x) with the 
following properties: 
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(i) Tr{x) has edges e[(a;) parallel to for i S {1,2,3}, and for x S i?A/(e(a;)), d{x,e'^{x)) = Td{x,e{x)) 
and d{e'^ (x) , e{x)) < d{x,e{x)) where d{x,e{x)) is the Euclidean (perpendicular) distance from x to 
e{x); 

(ii) Tt{x) has the same orientation as and is similar to T (3^3); 

(iii) a: is the same type of center of Tr{x) as M is of T (3^3). 

Note that (i) implies the parametrization of the PCD, (ii) explains "similarity", and (iii) explains "central" 
in the name, central similarity proximity map. For t = 0, NJj^'^[x,M) := {x} for all x € T'(3^3). For 
X e 9(^(3^3)), ^5s(a;,M) := {x} for all r G [0,1]. 

Notice that by definition x G NQg{x, M) for all a; G T (3^3). Furthermore, r < 1 implies that N]jg{x, M) C 
T(3^3) for all X G T(3;3) and il/ G T(3^3)°. For all x G T(3^3)° n i?M(e(a;)), the edges e^{x) and e(a:) are 
coincident iff r = 1. Sec Figure [23] for the arcs based on iVJg^(x,Mc) for 20 X points in the one triangle 
case. 

Notice that Xi *~ F, with the additional assumption that the non-degenerate two-dimensional pdf / 
exists with support S{F) C T (3^3), implies that the special case in the construction of N^g{-) — X falls on 
the boundary of two edge regions — occurs with probability zero. Note that for such an F, iVJg(X, M) is 
a triangle for r > a.s. 



/ ^^\A 










/ / 

/ ^,.^^d{x,eX,{x)) = Td{x,e{x)) 


X) 


d{x, e(:c)}^^/\ 



63 = e{x) 



T— 1 /2 

Figure 22: Construction of central similarity proximity region, N^jg (.t, Mc) (shaded region) 



Notice that central similarity proximity maps are defined with Af-edge regions for AI G T (3^3)°. Among 
the four centers considered in Scction[6l Mc and Mi arc inside the triangle, so they can be used in construc- 
tion of the central similarity proximity map. 

With M = Mc, for x G i?cA/(e), the similarity ratio of N^^gix, Mc) to T (3^3) is d{x, e^)/d(Afc, e). 

See Figure[22]for N^^^^'\x, Mc) with e = 63 and FigureElfor iV<5iHa;, Mc) with e = 63. The functional 
form of N^g{x, Mc) for an x — (xo, ya) G i?cA/(e) is as follows: 
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Figure 23: A realization of 20 X points generated iid U{T {y^)) (left) and the corresponding arcs for 
N^C=s\x,Mc) (right). 

For X e i?cM(ei), 

Arr / ,^ N // N ^ rp ^ , f (^ \\ ^ (2/0(1 -ci)-a;oC2) + C2(T-a;) 
Ncs\^^ ^■^c) = <{x,y) eTb:y>yQ + T [yo (1 - ci)), y < ; 

yo (Cl {^+t) - Xq C2 (1 + t) + C2 {t + X) \ 

Cl I ' 



< 



For X G RcM{e2), 

(1 - t) (yo Cl - Xq C2) +C2X 



^csi^^ ^'^c) = {{x,y) eTb:y>ya+T (yo ci - C2 xo), y 



< 



2/< 



Cl 

zjo(l-ci(l+T)) + a;o C2 (1 + r) 



1-ci 
For X e RcAiies), 

(yo (t + Cl) + C2 (x - a;o))_ 



iV5s(x, Mc) = Ux,y)en:y>yoil- r); y < 



Cl 

yo (1 - Cl + r) - C2 (x - xq) 



y< 



l-Ci 



8.3.1 /C-Central Similarity Proximity Regions 

With M = A//, the similarity ratio is d{x, er)/d{Mi , e). See Figure [24l for N^=f-{x,Mi) with e = 63. The 
functional form of NQg{x, Mj) for an a; = (xo,yo) € Ric{e) is as follows: 
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0.2 0.4 o.e 0.8 



0.2 0.4 




0.2 0.4 



Figure 24: N^g{x,M) with an a; G Rm{e5) (top left); N^g{x,Mc) with an x G -Ra/c(s3) (top right); and 
^cs^i^T^ic) with an a; e Rmic{^^) (^^P right). 



If X G i?/c(ei), then 



(^^c2 + (l-ci)2 + r (1 - ci)) + C2 r (a::(, - r) 



+ (1 - ci)2 

(1 - r) (yo (1 ~ ci) + xo cz) + C2 (r - x) ^ ^ ( ^c^ + (1 - cQ^ + r ^c^ + e^) (yp ci - a-p ca) - yo r ^ c\ + 

y ^ -I 1 y — 



l-ci 



Ci Vc^ + (1-Ci)2 



If X G Ric{g2), then 



7V5s(x,A'//) = ax,y)erb:2;> 



j/o (y/c^ + c^ + rci) -rc2Xo ^ ^ (1 - r) (ci - C2 xo) + C2X . 



y < 



( VcfTcf + T v/c^ + (1 - Cl)2^ (Xo C2 - yo Cl) + v/cf+cf (yo - C2 x) 



(l-ci)V^ 
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If X G Ric{gz), then 



iV55(x, Mi) = {{x,y)en:v>vo{l~ r); y < 



y< 



yO (ci + T Vcf+cf^ + C2 (X - Xo) 
Cl 

2/0 (^1 - Cl + T ^C^ + (1 - Cl)2^ + C2 (.To - X) 



N^g{-, Ml) also satisfies all the properties P1-P9. 

Remark 8.3. For acute triangles we could use CC or OC-edge regions in central similarity proximity regions 
which will also satisfy properties P1-P9. But for obtuse triangles, P2 is not satisfied and edge regions are 
not defined in a natural manner. 

In general for Af-central similarity proximity regions, the similarity ratio is d{x, eT-{x))/d(M, e{x)). See 
Figure [Ml for 7Vp^^(a;, M) with e = 63. The functional form of N^g{x, M) for an x = (xq, yo) £ Rhiie) is as 
follows: 



If X € i?A/(ei), then 
N^si=^,M) = \ix,y)en:y> 



yo {m2 (1 - t) (ci - 1) + C2 (1 - mi)) - T m2 C2 (1 - xq) 



^ (1 - 't) (yo (1 - Cl) + Xo C2) + C2 (r - x) 

y < ^ ; y < 

1 - Cl 



^^2 (1 - Cl) + C2 (1 - mi) 

yo (ci m2 (1 - t)(1 - Cl) +mi C2 (r + ci) - ci C2 (1 + 



r mi)) + a;o (c2 (1 — mi (1 — r)) + C2 m2 (ci (1 - r) - C2)) + r C2 (m2 ci — mi C2) + (c2 m2 (1 — ci) — C2 (1 - 
mi)) a; / ci (m2 (1 - ci) - ci (1 - mi)) 



If x G i?A/(e2), then 
N^si^^M)^\{x,y)eTt,:y> 



yo (c2 rni - Cl 1712) - T m2 (c2 xq - ci yo) (1 - t) (yo ci - xq C2) +C2X 



mi C2 - Cl 7712 



y< 



Cl 



< 



yo (ci m2 - C2 mi) + T Cl (C2 (1 - mi) - m2 (1 - Cl)) + Xq {cl (-T - mi (1 - t)) + m2 C2 (t (1 - Ci) + 



Cl)) + C2 (mi C2 — m2 Cl) x (1 - ci) (ci m2 — C2 mi) 



If X G -Rj\/(e3), then 

iV5s(x, M) ^\{x,y)£n:y>yo{l~ r); y < 



yo (ci m2 (1 - r) + C2 T mi) + C2 m2 {x - xq) 



Cl m2 



y< 



yo {m2 (1 - Cl) (1 - t) + C2 r) + C2 m2 {xp - C2) 
(1 - ci)m2 



Notice that NQg{-,M) also satisfies properties P1-P9. Af-central similarity proximity regions with 
mi < Cl can be defined in a similar fashion. Furthermore, Aq {N^g{-, M)) = d{T (3^3)) for all t G (0, 1] and 
Ao (iVci°(-' M)) = T (3^3), since A {Nl.g{x)) = iff x G for i G {1, 2, 3} or r = 0. 

Remark 8.4. Among the family := {N[;g{-,M) : t G [0, 1], M G T{y3)"}, every Nl;g{-,M) with 

r G (0, 1] satisfies all the properties in Section [4] □ 

Remark 8.5. 

• For Ti < T2, Nl^six^M) C N^^s{x,M) for all .t G r(3^3). For n < T2, N^^g{x,M) C iVj^_5(a;,M) with 
equality holding only for x £ d{T (3^3)). 
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• The superset region ^5 (A^^^, M) = for r G [0, 1) and .^5 (iV5|\ M) = {M}. 

• For Ti < T2, A {N^'siX, M)) <-5^ A {N'^g{X, M)) for X from a continuous distribution on T (J^g). □ 
8.3.2 Extension of N'[.g to Higher Dimensions 

The extension of N^g to for > 2 is straightforward, the extension for M = Mc is described, the 
extension for general M is similar. Let yd+i = {yi,y2, • ■ ■ ,yd+i} be d + 1 points that do not lie on the 
same (d — l)-dimensional hyperplane. Denote the simplex formed by these d + 1 points as S(3^d+i)- For 
T € (0, 1], define the central similarity proximity map as follows. Let fi be the face opposite vertex for 
i € {1,2, . . . , {d + 1)}, and "face regions" Rcn-iifi), • ■ • , RcAii^d+i) partition 6(3^d+i) into d + 1 regions, 
namely the d + 1 polytopes with vertices being the center of mass together with d vertices chosen from d + 1 
vertices. For x € ©(J^^+i) \ 3^d+i, let ip{x) be the face in whose region x falls; x S R{ip{x)). If x falls on 
the boundary of two face regions, f{x) is assigned arbitrarily. For r G (0, 1], the central similarity proximity 
region N^g^x, Mc) = &t{x) is defined to be the simplex 6t{x) with the following properties: 

(i) &r{x) has faces (pj{x) parallel to ipi{x) for i e {1, 2, . . . , (d+1)}, and for x S RcM{<f{x)), Td{x, (p{x)) = 
d{(f'^{x),x) where d{x,(p{x)) is the Euchdean (perpendicular) distance from x to <f{x); 

(ii) &rix) has the same orientation as and similar to S(3^d+i); 

(iii) X is the center of mass of ©,-(2;), as Mc is of 6(3^d+i)- Note that r > 1 implies that a; € Nl.g{x). 
8.4 The Behavior of Proximity Regions 

In this section, we provide the conditions for x, which, if satisfied, will imply some sort of increase in the size 
of the proximity regions we have defined. Let N{-) be any proximity map defined on the measurable space Q 
with measure ^, and let {xn}'^=i be a sequence of points in fl. We say N{xn) gets larger if N{xn) ^ N(xm) 
for m > n, and N(xn) gets strictly larger if N{xn) $i N{xm) for m > n. 

In the following theorems we will assume 17 = with fj, being the M^-Lebesgue measure A and M-vertex 
regions are defined with points M eM.'^\y3. 

Theorem 8.6. For arc-slice proximity regions with M-vertex regions for an AI G K^\3^3, as d{x, y) (strictly) 
increases for x lying on a ray from y in i?j\/(y) \ Ms{NaSi M), Nas{x, M) gets (strictly) larger. 

Proof: For x,y lying on a ray from y in RRiiv) \ ^s{Nas,M), if d{x,y) < d{y,y), then B[x,r{x)) C 
B{y,r{y)), which imphes Nas{x,M) C NAsiUiM), hence Nas{x,M) gets larger as d{x,y) increases for x 
lying on a ray from y in Ruiv) \ ^s{Nas,M). The strict version follows similarly. \i x,y G i?M(y) n 
^s(^AS, M), then Nas{x, M) = Nas{.V, M) = T (3^3)- ■ 

Let £(y, x) be the line at x parallel to e{x) for x G RAiiy) where e{x) is the edge opposite vertex y. 

Theorem 8.7. For the proportional- edge proximity maps with M-vertex regions for an M G K.^ \ 3^3, as 
d(£(y, a;), y) (strictly) increases for x G i?j\f (y) \ I^s iNpE^ ^'^)' ^pe{x, M) gets (strictly) larger for r < 00. 

Proof: For x, y e i?M(y) \ {N^e^ M), if d(£(y, x),y) < d{l{y, y), y), then by definition iV|,^(a;, M) C 
Np^{y,M), hence the result follows. The strict version follows similarly. lix,y & i?M(y) <^ {Np^, M), 
then 7V-^(x, M) = N-p^{y, M) = T (3^3), and if r = 00 and x, y G T (3^3) \ 3^3, iV^sl^. ^) = ^Pis(2/> M) = 
T(3^3)." 

Note that as d{£{y,x),y) increases for x G -Rj\f(y), d{£{y,x),M) decreases, provided that M G T'(3^3)° 
and M-vertex regions are convex. 

We define the M-edge regions, i?M(e), with points M G r(3'3)°. 
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Theorem 8.8. For central similarity proximity regions with M-edge regions for an M G T (3^3)°, as d[x, e) 
(strictly) increases for x G RM{e), the area A {Nq^^x, M)) (strictly) increases for t G (0, 1]. 

Proof: For x, y £ RM{e) and r G (0,1], if d{x,e) < d{y,e) then the similarity ratio of NQg{y,M) 
to T (y^) is larger than or equal to that of N^g{x, M), which in turn implies that A{N^g{x, M)) < 
A{N^g{y, M)). The strict version follows similarly. ■ 

Observe that the statement of Theorem 18. 8 1 is about the area A {Nl;g{x, M)). We need further conditions 
for N^g{x, M) to get larger. 

Theorem 8.9. Let ^M(y) be the line joining M and vertex y G 3^3. As d{x,(.M{yj)) and d{x,£M{yk)) both 
(strictly) decrease for x G Rm{ei) where j, k, I are distinct, N^g{x, M) (strictly) increases for r G (0, 1]. 

Proof: Suppose, without loss of generality, that x, y £ Rnjies). Consider the set 

S{e3,x) := {y G i?M(e3) : d{y,£M{yi)) < d{x,iM{yi)) and d{y,lM{y2)) < 4/ (y2))} , 

which a parallelogram. See Figure [55] for an example of S'(e3,a;) with M = Mc and e = 63. Given x, 
for y G S{e3,x), by construction, N^g{x, M) C N^g{y, AI). Then the desired result follows for r G (0,1]. 
Observe that if Xn+i is in S'(e3,a;„), then d{xn,£M{yi)) and d{xn, £M{y2)) both decrease. The strict version 
follows similarly. ■ 




Figure 25: The figure for x, y £ RAiies) described in Theorem 18.91 

Remark 8.10. For ^%s{Ny) with positive area, by definition, as a; ?; G ^s{Ny), Ny{x) T {y^,) and 
hence a.Ygsn^^^T^^y^■^ A{Ny{x)) G 3gs{Ny) with snv^(.T(y,) MNy{x)) = A{T (3^3)). 

• As a; Mac in a non-obtuse triangle T (3^3), Nas{x, Mcc) T (3^3) and 

argsup,gj,(3;3) ^ {Nas{x, Mcc)) = Mcc with sup A {Nas^x, Mcc)) = A{T (3^3))- 

xeTiys) 

• For r > 3/2, as x ^ y £ {<^s (N^e^M))", N^j^ix, M) ^ T (3^3) hence 

&rgsnY>,^T(y,)A{N-pE{x,M))£££s{N'pE,M) with sup A{K^pe{x, M)) ^ A{T {y^)). 

xeTiya) 

• For r < 3/2, if M ^ then as x -> M, 7V^b(x, M) ^ T (3^3) and 

argsup,g^(j;3) AiN^^^Eix, M)) = M with sup A(7V^s(x, M)) = A{T (y^)). 

x£T{ya) 
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If M e {.^'')°, then as a; ^ M, N^e{x, M) ^ N'pj^iM, M) C T (3^3), but still 

argsup,gj,(3;3) A{N-pe{x, M)) = M with sup A(iV^s(a;, M)) - A(iV^s(A/, M)). 

If Af e 9(^0, then as a; -> M, iV^s(.T, M) -> N^e{M, M) C T (3^3), and 

argsup^gj,(j;3) A(Ar^£;(a;, M)) = M with sup A(iV^£;(x, M)) = A(iV^£(M, Af)) 

which might be T (3^3) or a proper subset of T (3^3). 

• ksx^ Mc, Ny^{x, Mc) ^ T (3^3) and 

argsup,gj,(^3) A {N^^ix, Mc)) = Mc with sup A (N^^ix, Mc)) = A{T (3^3))- 

• As x ^ M, iV^^Ha;, ^ r (3^3) and 

argsup,gj,(y,) A{Nirs\x, M)) = M with sup A(iV5^i(a:, M)) = A(T (3^3)). 

For r < 1, as x ^ M, N^^gix, M) -> N]jg{M, M) and 

argsup,gj,(3;3) ^ (^£s(a;, A^)) = with sup A {N^si^, M)) = A{N^siM, A/)). □ 

Although the comments in the above remark follow by definition, they will be indicative of whether the 
asymptotic distribution of the domination number of the associated PCD is degenerate or not. 

9 Relative Arc Density and Domination Number of PCDs 
9.1 Relative Arc Density 

The relative arc density of a digraph D = (V,^) of order |V| = denoted as p{D), is defined as 

1^1 



PiD) 



n(n — 1) 



where | • | denotes the set cardinality functional ( Janson et al.l ( 2000f l). Thus p{D) represents the ratio of the 



number of arcs in the digraph D to the number of arcs in the complete symmetric digraph of order n, which 
is n{n — 1). For brevity of notation we use relative density rather than relative arc density henceforth. 

If ATi, • • • , Xn F the relative density of the associated data-random PCD D, denoted as p{Xn', h, Ny), 
is a [/-statistic, 

p{X^;h,Ny) = \ J2 E^(-^-'-^^'^y) (2) 



where 



h{X,, Xf,Ny) = I{(X„ Xj) eA} + X,) e A} 

= i{x,eNy{Xi)} + l{x,eNy{x,)}, (3) 
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where I(-) is the indicator function. Wc denote h{Xi, Xj; Ny) as hij for brevity of notation. Since the 
digraph is asymmetric, hij is defined as the nu mber of arcs in D between vertices Xi and Xj, in order to 



produce a symmetric kernel with finite variance (jLehmann 

(UMi)). 

The random variable p„ :— p{Xn',h,Ny) depends on n and Ny explicitly and on F implicitly. The 
expectation E [pn], however, is independent of n and depends on only F and Ny: 



< E [p„] = -E [hi2] < 1 for aU n > 2. 



The variance Var simplifies to 
< Var = 



-Var [hi 



2n{n — 1) n{n — 1) 

A central limit theorem for t/-statistics ( Lehmann (ligSSl) ) yields 

V^{pn - E [p„]) ^ AA(0, Gov [/li2, hi3]) 



Cov[/ii2,M < 1/4. 



(4) 



(5) 



(6) 



provided Gov [/112, /iis] > 0. The asymptotic variance of p„. Gov [ft,i2, /113], depends on only F and Ny. 
Thus, we need determine only E [/ii2] and Gov [/112, /iia] in order to obtain the normal approximation 



approx 



J\f{E [p„],Var [p„ 



,'E[/li2] Gov[/li2,/ll3]\ „ , 

: Jv I — - — , I tor large n. 



(7) 



9.2 Domination Number 

In a digraph D = {V,A), a vertex v € V dominates itself and all vertices of the form {u : vu £ A}. 
A dominating set Sd for the digraph D is a subset of V such that each vertex v G V is dominated by a 
vertex in Sd- A minimum dominating set Sn is a, domin ating set of minimum cardinality and the domination 



number ^(D) is defined as ^(D ) := \S*n\ (see, Led (1998)) where | • | denotes the set cardinality functional. See 
Chartrand and LesniakI |l996l ) and lWestj (|200l[ ) for more on graphs and digraphs. If a minimum dominating 



set is of size one, we call it a dominating point. 

Note that for |V| n > 0, 1 < 7(-D) < n, since V itself is always a dominating set. 



9.3 Asymptotic Distribution of Relative Arc Density of PCDs 



By detailed geometric probability calculations, provided in ICevhan et aU (|2006l ) and ICevhan et aU (|2007l ) 
the mean and the asymptotic variance of the relative density of the proportional-edge and central similarity 
PCDs can explicitly be computed. The central limit theorem for [/-statistics then establishes the asymptotic 
normality under the uniform null hypothesis. These results are summarized in the following theorems. 



Theorem 9.1. For r G [l,oo), 



where 



37 „2 
216 



p{r) 



1 - 



3 -2 



■A/(0,1) 



for re [1,3/2), 
for re [3/2,2), 
for r e [2,00), 



(8) 



(9) 
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and 

i.(r)==:/i(r)I(re [1,4/3)) + 1^2(0 I(re [4/3, 3/2)) + z.3(r)I(re [3/2,2)) + i.4(r)I(re [2, oo]) (10) 

with 

3007 r-^" - 13824 + 898 + 77760 - 117953 r*^ + 48888 - 24246 + 60480 - 38880 + 



58320 r4 

5467 r-^" - 37800 + 61912 + 46588 - 191520 + 13608 r'' + 241920 - 155520 + 15552 



!/i(r) 
^^2 (r) 

^ ' 233280 r4 

'^ai'r) = -[7r-^^ - 72 +312r"^ - 5332 r** + 15072 + 13704 - 139264 + 273600 - 242176 

+ 103232 - 27648 r + 8640]/[960 r% 
15r'*-llr^-48r + 25 

""'^'^ = • 

For r ~ 00, Pn{r) is degenerate. 

Theorem 9.2. For r e (0, 1], the relative density of the central similarity proximity digraph converges in 
law to the normal distribution; i.e., as n 00, 

V^(PnW_MM^AA(0,l) (11) 

where 

^(r) = rV6 (12) 



T4(6r5 - 3r4 - 25t3 + t2 +49r + 14) 

t^(T) = : r-, r-, ^ . (13) 

^ ' 45(r+l)(2r + l)(r + 2) ^ ' 

For T ~ 0, Pn{T) is degenerate for all n > 1. 

9.4 Asymptotic Distribution of Domination Number of PCDs 

Recall the triangle defined in Equation ([T]) (see also Figure [21] for J^'' with r = x/2). Let 7„(r,M) := 
7 {Xn, Np^, M) be the domination number of the PCD based on Np^ with a set of iid random variables 
from U{T (3^3)), with A/-vertex regions. 

Th e domination number 7n(f, M) of the PCD has the following asymptotic distribution ( Cevhan and Priebd 
(I2OO7I) ). As 00, 



r 2 + BER(l-p,) for r G [1,3/2) and A/ G {ti(r),t2(r),i3(r)}, 
7„(r, M) ^ < 1 for r > 3/2 and M G T (3^3)°, (14) 

i 3 forrG [1,3/2) and A/ G ^''\{ti(r),t2(r),t3(r)}, 

where — > stands for "convergence in law" and BER(p) stands for Bernoulh distribution with probability of 
success p, and <i(r) are defined in Equation and for r G [1,3/2) and M G {ti{r),t2{r),t^{r)}, 

roo ^00 g^^2 / 4r \ 

9 _ 1)2 cxp ( (lUi + 2r (r - 1) Wi Wi)\ dwsWi, (15) 



JO 



and for r = 3/2 and M = Mg = {(1/2, a/S/G)}, Pr ~ 0.7413, which is not computed as in Equation for 
its computation, see Ceyhan and Priebd (|2005[ ). For example, for r = 5/4 and Af G {ti{r) = (3/10, -\/3/10) , ^2(7') 
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Figure 26: Plotted is the probability pr — lim„^oo P {inir, M) = 2) given in Equation (|15p as a function of 
r for r £ [1,3/2) and M e {ti(r), tzW, t3(r)}. 

~ 0.6514. See Figure [26] for the plot of the numerically computed values (i.e., the values computed by 
numerical integration of the expression in Equation (jl5p ) of pr as a function of r. Notice that in the 
nondegenerate case in (fT4|) . E [7„(r, M)] = 3 — Pr and Var [7„(r, A/)] = Pr(l — Pr)- 

The results in Theorem [521 and Corollaries l5.5l and l5.6l also hold for relative arc density and the domination 
number of PCDs based Ny. That is, we have the following corollary. 

Corollary 9.3. Given any triangle To and Xn a random sample from U{To). Suppose the PCD, Do is 
defined in such a way that the ratio of the area of N{x) to the area of the triangle To is preserved under the 
uniformity preserving transformation, then the distributions of the relative arc density and the domination 
number of Do are geometry invariant. 

10 Two New Proximity Maps 

In this section, we introduce two new proximity maps and investigate their properties. 
10.1 Directional-Doubling Proximity Maps 

Without loss of generality, we can assume that T {y^) = T^. Partition the triangle T^ by M-edge regions to 
obtain RM{ei) for i = 1,2,3. For z g RMiei), directional-doubling proximity map is defined as 

Ndd{z, M) {x G Tb : d{x, e,) < 2 d{z, e,)} . 

See Figure [271 (left) for M = Mc- li z € then Ndd{z, M) := ti. Notice that if z ^ e^, then Ndd{z, M) is 
a quadrilateral. Among the properties, PI and P2 follows trivially. The line at z g RM{ei) parallel to 
divides the region into two pieces (half-way in the perpendicular direction to e^) so P3 holds in this special 
sense. P4 and P5 both fail, since Ndd{z, M) is a quadrilateral. P6 holds if M e r(A/i, M2, M3), otherwise 
it fails since ^si^DD, M) will have positive area. P7 also follows by definition. However, P8 holds only 
when M = Mc- 

Property P9 follows for Ndd, since Nod{z, M) is constructed with the boimdary of T (3^3) and parallel 
lines to the edges, by Corollarv l5.51 geometry invariance for uniform data follows. That is, the distributions 
of relative arc density and the domination number of the corresponding PCD do not depend on the geom- 
etry of the triangle r(3^3). Hence, it suffices to compute them for the standard equilateral triangle only. 
Furthermore, Kq{Ndd) = d{T (J^'g)) since Ndd{x, M) has zero area iff x g d{T (3^3)). 
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Figure 27: An example of directional-doubling proximity region (left) and double-X proximity region (right). 



10.2 Double-X Proximity Maps 

Without loss of generality, we can assume that T (3^3) = Tf,. Partition the triangle Tf, using the vertical line 
at M3 = (1/2,0) as in Figure [27l(right). Let i?i := {{x,y) e Tb : x < 1/2} and R2 := {ix,y) € Th : x > 1/2}. 
If {x,y) G Tf, with x = 1/2, assign {x,y) arbitrarily to one of Ri or R2. Wc define the double-X proximity 
map as follows. For z ~ (xo, yo) G Tfa \ {yi, y2} 

~ \{{x,y)en:l-x<2{l-Xo)] if z e R2. 

If z = {xo,yo) e {yi,y2}, then Ndx{z) ■= {z}. See also Figure [27l (right). Notice that if z ^ {yi,y2}, then 
Ndd{z, M) is a right triangle or a quadrilateral. Among the properties, PI and P2 follows trivially. The 
vertical line at z divides the region into two pieces (half-way along the x-coordinate) so P3 holds in this 
special sense. P4 and P5 fails to hold, since Ndd{z, M) may be a quadrilateral for some z £ Tj,. P6 holds 
if M = (1/2, y), otherwise ^si^DX, M) has positive area. P7 also follows by definition. However, P8 holds 
only when the regions i?i and R2 are constructed at a point where the vertical line divides the area into two 
equal pieces. 

Property PIO fails, since Noxiz) is constructed with the boundary of T [y^i) and a line with a specific 
angle (perpendicular to the largest edge), by Corollary (521 geometry invariance for uniform data does not 
hold. That is, the distributions of relative arc density and the domination number of the corresponding 
PCD depend on the geometry of the triangle ^(J^a). Hence, it does not suffice to compute them for the 
standard equilateral triangle only, but instead one should compute them for each pair of (ci, C2). Moreover, 
Ao(-/V_Dx) = {yi,y2} since Ndx{x,M) has zero area iff x € {yi,y2}- 



11 Discussion and Conclusions 

In this article, we discuss the construction of proximity catch digraphs (PCDs) in multiple dimensions. PCDs 
are a special type of proximity graphs which have applica tions in various field s. The class cover catch di- 
graph (CCCD) is the first type of PCD family in literature (jPriebe et al.l (|200ll) ) which is based on spherical 



proximity maps and has "nice properties" for uniform data in M, in the sense that, the exact and asymp- 
totic distribution of the domination number for CCCDs is available for one-dimensional uniform data. We 
determine some of the properties of the spherical proximity maps in M (called appealing properties), and use 
them as guidelines for extending PCDs to higher dimensions. We also characterize the geometry invariance 
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for PCDs based on uniform data. Geometry invariance is important since it facilitates the computation of 
quantities (such as relative arc density or domination number) related to PCDs. 

We discuss four PCD families in literature and introduce two new PCD families in this article. We 
investigate these PCD families in terms of the appealing properties and in particular geometry invariance 
for uniform data. We provide the asymptotic distribution of relative arc density and domination number 
for some of the PCD families. These tools have applications in spatial point pattern analysis and statistical 
pattern classification. Wc have demonstrated that the more the properties are satisfied, the better the 
asymptotic distribution of relative density. Furthermore, the availability of the asymptotic distribution of 
domination number also is highly correlated with the number of properties satisfied. 

The spherical proximity regions were defined with (open) balls only, whereas the new proximity maps are 
not based on a particular geometric shape or a functional form; that is, the new proximity maps admit any 
type of region, e.g., circle (ball), arc slice, triangle, a convex or nonconvcx polygon, etc. In this sense, the 
PCDs are defined in a more general setting compared to CCCD. On the other hand, the types of PCDs we 
introduce in this article arc well-defined for points restricted to the convex hull of 3^,n, (3^m)- Moreover, 
the new families of proximity maps wc introduce will yield closed regions. Furthermore, the CCCDs based 
on balls use proximity regions which are defined by the obvious metric, while the PCDs do not suggest an 
obvious metric. 

The mechanism to define the proximity maps provided in this article can be also used for defining new 
(perhaps with better properties) proximity map families. 
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